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Abstract 

Background: Epigenetics refers to the reversible functional modifications of the genome that do not correlate to 
changes in the DNA sequence. The aim of this study is to understand DNA methylation patterns across different 
stages of lung adenocarcinoma (LUAD). 

Results: Our study identified 72, 93 and 170 significant DNA methylated genes in Stages I, II and III respectively. A 
set of common 34 significant DNA methylated genes located in the promoter section of the true CpG islands were 
found across stages, and these were: HOX genes, FOXG1, GRIK3, HAND2, PRKCB, etc. Of the total significant DNA 
methylated genes, 65 correlated with transcription function. The epigenetic analysis identified the following novel 
genes across all stages: PTGDR, 71X3, and POU4F2. The stage-wise analysis observed the appearance of NEUROG1 
gene in Stage I and its re-appearance in Stage III. The analysis showed similar epigenetic pattern across Stage I and 
Stage III. Pathway analysis revealed important signaling and metabolic pathways of LUAD to correlate with epigenetics. 
Epigenetic subnetwork analysis identified a set of seven conserved genes across all stages: UBC, KRAS, PIK3GA, PIK3R3, 
RAF1, BRAF, and RAP1A. A detailed literature analysis elucidated epigenetic genes like FOXG1, HLA-G, and NKX6-2 to be 
known as prognostic targets. 

Conclusion: Integrating epigenetic information for genes with expression data can be useful for comprehending 
in-depth disease mechanism and for the ultimate goal of better target identification. 
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Background 

Cancer progression is associated with mutation and differ- 
ential gene expression [1]. Many oncogenes and tumor 
suppressor genes responsible for cancer are linked to 
mutations [2]. Besides these mutations, recent studies 
correlate epigenetic features to play an important role in 
cancer development and propagation [3-10]. Epigenetics 
refers to all gene modifications except the change in the 
DNA sequence [11]. These modifications are caused by 
changes in the chromatin structure [11], DNA methyla- 
tion, and histone modifications. Nearly 50% of human 
genes are associated with CpG islands in the promoter 
regions [12]. If these promoter regions undergo methy- 
lation, they lead to disease progression [12-14]. DNA 
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methylation occurs in and out of CpG islands, which in 
a normal tissue is entirely unmethylated at all stages of 
development and allows gene expression if a transcrip- 
tion factor (TF) is present [15]. These changes affect the 
binding of transcription factors (TFs) to DNA [16]. This 
occurs by DNA methylation of the 5-CG-3' pair usually 
observed at the transcription regulation sites, which results 
in silencing or activation of the downstream genes [17]. 

Advances in next-generation technologies have led to 
identification of genome-wide DNA methylations in a 
large number of disease samples. Methylation sites have 
been analyzed based on clustering with respect to genomic 
regions, methylation patterns, and common regulatory 
patterns [16]. Increased methylation of CpG islands in 
the promoter regions known as hypermethylation leads 
to silencing of genes, usually associated with tumor 
suppressor genes [18], whereas the decreased methyla- 
tion known as hypomethylation is associated with gene 
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overexpression i.e., activation of oncogenes [18]. Both 
hypermethylation and hypomethylation are known to 
be linked to tumors, autoimmune and other diseases 
[16,19,20]. In cancers, many genes are methylated in 
normally unmethylated promoter CpG islands, eventually 
influencing transcriptional activity both in early and late 
stages [19,21,22]. Specific examples of DNA methylation 
role in cancers include hypermethylation of BRCA1 in 
breast and ovarian cancer [22,23]; DOK7 in breast cancer 
[22]; MYOD1 in hematological neoplasm; APC, HOX2, 
OTX1 genes in non-small cell lung carcinoma (NSCLC) 
[24]; FEN1 in breast tumor cells [25]; and hypomethyla- 
tion of TKTL1 in head and neck squamous cell carcinoma 
[26]. Literature provides evidence correlating transcrip- 
tional activities with methylated genes [12], suggesting 
the role of higher methylation in lowering the transcrip- 
tional activity [4]. Since these epigenetic alterations are 
reversible, identification of methylated genes for targeted 
modifications in cancer can provide a new approach to 
successful drug therapies. 

Lung cancer is one of the most commonly diagnosed 
cancers in United States. Lung cancer is morphologically 
divided into NSCLC and small cell lung cancer (SCLC) 
[27]. NSCLC is classified into three major histopatholgical 
subtypes: adenocarcinoma, squamous cell carcinoma and 
large cell carcinoma. Lung adenocarcinoma (LUAD) is 
currently the most common of the lung cancers in both 
smokers and non-smokers. LUAD is classified into four 
stages: Stage I, when the cancer is localized; Stage II, when 
the cancer has spread to the lymph nodes; Stage III, when 
the cancer has spread to tissues near the lungs; and, Stage 
IV, when metastasis has occurred [28] . Few DNA methyla- 
tion studies have been reported for NSCLC, and the DNA 
methylated genes identified in these were APC, CDH13, 
CDKN2A, DAPK, hMLHl, HOX, OTX1, HOX2, ZIC4, 
and RASSF1 [24,27,29,30]. There have been no stage- 
wise methylation studies reported on LUAD. It has been 
observed that LUAD is highly heterogeneous, and there 
is less similarity between stages and across the samples 
within the stages [24], therefore understanding DNA 
methylated genes profile across LUAD can provide a new 
insight. 

The aim of the study was to elucidate the DNA methyla- 
tion patterns across different stages of LUAD from pub- 
licly available data resources. We used The Cancer 
Genome Atlas Data (TCGA) as our resource for methyla- 
tion data. In this study, a systems biology approach of inte- 
grating gene-expression, DNA methylation and protein- 
protein interaction data for finding highly important DNA 
methylated genes across stages of LUAD were developed. 
These DNA methylated genes were compared across 
stages for their uniqueness and commonality to identify 
the patterns across stages of LUAD. These patterns were 
then validated and ranked for their importance in LUAD 



using literature evidences [3,31]. These ranked patterns 
were analyzed as potential targets of LUAD. The limita- 
tion of the study was the laboratory validation of the 
targets and availability of datasets in the TCGA. To our 
knowledge this is the first study that explains the DNA 
methylated genes across stages of LUAD. 

Results 

The objective of this study was to understand the Sig- 
nificant DNA methylated genes across the four stages 
of LUAD and analyze them as potential targets. The 
TCGA data associated with LUAD was classified based 
on these stages. For this stage-wise data, the patients 
age ranged from 58-75 with few outliers. The Signifi- 
cant expressed genes and Significant DNA methylated 
genes were identified based on the p-values and beta- 
values for each stage as described in the methodology. 
Resampling technique were performed for the correction 
and these provided the set of p-values. Using the tech- 
nique used in paper [32], p-value of 0.0012 was obtained 
from q-values. Using this cutoff the Significant DNA 
methylated genes were re-evaluated and overlap between 
the previous and resampled results were calculated. A 
substantial amount of overlap between old and new set of 
Significant DNA methylated genes were observed. Add- 
itional file 1 shows the p-value correction for original and 
corrected Stage I data after resampling. The Significant 
DNA methylated genes were then further classified as 
hypermethylated and hypomethylated (methodology 
section). Table 1 lists the statistics for each stage. 

The Significant DNA methylated genes were compared 
across stages as shown in Figure 1A. From the Venn dia- 
grams of Figure 1A it can be seen that the maximum 
number of Significant DNA methylated genes were iden- 
tified in Stage III and minimum in Stage I. 

As shown in Figure 1A, there were 34 common Signifi- 
cant DNA methylated genes across all stages. Additionally 
46 common Significant DNA methylated genes were 
identified between Stages I and III, and 64 were identi- 
fied between Stages II and III. Figure IB and 1C shows 
the distribution of hypermethylated and hypomethy- 
lated genes across stages. Of the 46 common Significant 
DNA methylated genes between Stages I and III, 42 were 

Table 1 Distribution of significant genes and significant 
DNA methylated genes across the four stages of LUAD 



Stage Number of Number of Significant DNA methylated 



normal 


disease 


genes 


genes 


samples 


samples 




Hyper 


Hypo 


1 2 


9 


15994 


67 


5 


II 7 


14 


16275 


20 


73 


III 5 


11 


14688 


110 


60 


IV 2 


6 


14814 


0 


0 
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was hypermethylated in both stages. On comparing with 
Stage II, it was observed 36 hypermethylated genes in 
Stage I were hypomethylated in Stage II and 25 of these 
were then hypermethylated in Stage III. From Figure 1C 
it can be stated that maximum number of hypomethylated 
genes were identified in Stage II. Similar to hypermethy- 
lated genes, no genes maintained the same hypomethyla- 
tion profile across the three stages. Of the four common 
hypomethylated genes between Stages I and III, two were 
identified as hypermethylated in Stage II. This suggests 
that genes in Stage II may have different patterns from 
those of Stages I and III. Table 2 lists the common Sig- 
nificant DNA methylated genes across and between stages. 

Identification of highly scored Significant DNA 
methylated genes 

The significant DNA methylated genes were analyzed and 
ranked based on their beta- values. Table 3 lists the top 10 
hyper/hypomethylated genes across stages in descending 
order of their beta-values. As shown in Table 3, ten of the 
top Significant DNA methylated genes in Stage I was 
Stage common across the three (Table 2). Of these 10, 
seven: A] API, ATP8A2, HOXA9, PTGDR, SIX6, TLX3, 

Table 2 Common DNA methylated genes across stages 

Stages 



TMEM130 were hypermethylated, and the three: KRTAP8-1, 
MMP26 and REG3A were hypomethylated. Three of the 
seven (Stage I) genes: A] API, TLX3, PTGDR were also 
identified in Stage III. Interestingly the three top scored 
hypomethylated genes in Stage I was identified as top 
scored hypermethylated in Stage II. In addition, some of 
the top scored DNA methylated genes were common 
across two stages only (Table 2): LY96 was the top scored 
hypomethylated gene and top scored hypermethylated in 
Stage I and II respectively. While HOXA4, HOXD10, 
KRTAP15-7, LEP, and NKX6-2 were identified as common 
across Stage II and III (Table 2). Table 3 also identified 
unique top scored Significant DNA methylated genes. 
Tables 2 and 3 have large number of Significant DNA 
methylated genes common among them. 

Significant DNA methylated genes in and outside of CpG 
islands, promoter regions, transcription factors, 
chromosomes and pathways 

The hypermethylated and hypomethylated genes were 
further analyzed with respect to their methylation inside 
and outside of the CpG islands. Table 4 gives the profile 
of this distribution. From this table, hypermethylated 



DNA methylated genes 



Number 



List of genes 



Common DNA methylated genes across 
the three stages 


34 


AJAP1, ATP8A2, CCDC140, CNTP2, CYYR1, EVX1, FERD3L, FOXG1, GRIK3, GRM6, HAND2, HOXA9, 
HOXB4, HOXD4, HOXD9, HOXD12, INPP5B, OTX2, KRTAP8-1, MMP26, PHOX2A, PLEKHA6, POU4F2, 
PRAQ PRKCB, PTGDR, REG3A, SIX6, SLC6A2, SPAG6, TBX20, TLX3, TMEM130, ZNF560 


Common DNA methylated genes across 
Stage 1 & II 


12 


ADCY4, BHMT, C12orf34, CD01, LVRN, LY96, MSQ PCDHGA 12, POU3F3, ZNF154, ZNF577, IHH 


Common DNA methylated genes across 
Stage II & III 


30 


BARHL2, C10orf81, CCDC140, DEFB1 19, DI03, FAM135B, FAM83A, GRIK2, HOXA4, HOXD10, HS3ST2, 
KCNS2, KRTAP15- 1, LEP, LHX1, LYPD5, MAGEB6, NEUROG1, NKX6-2, OR5I1, SERPINB5, SPHKAP, TALI, 
TBX4, TBX5, TCN1, TMEM132D, VSX1, ZNF454, IGKV7-3 


Common hypermethylated genes across 
Stage 1 and Stage III 


42 


AJAP1, ATP8A2, CCDC140, CNTP2, CYYR1, EVX1, FERD3L, FOXG1, FOXI2, GALR1, GAS7, GRIK2, GRM6, 
HAND2, HLA-G, HOXA7, HOXA9, HOXB4, HOXD4, HOXD8, HOXD9, HOXD12, INPP5B, NID2, NPY, OTX2, 
PAX7, PHOX2A, PLEKHA6, POU4F2, PRAQ PRKCB, PTGDR, SIX6, SLC6A2, SOX17, SPAG6, TBX20, TLX3, 
TMEM130,VIPR2,ZNF560 


Common hypomethylated genes across 
Stage 1 and Stage III 


4 


COR06, MMP26, REG3A, KRTAP8-1 



Pradhan et al. BMC Systems Biology 2013, 7:141 
http://www.biomedcentral.eom/1 752-0509/7/1 41 



Page 4 of 21 



Table 3 Identification of top beta-value scored DNA methylated genes across stages 



Stage Hyper/Hypo Genes in descending order of beta-values (p <0.001) 



1 Hyper 


TLX3 > NEFM > PTGDR > AJAP1 > SIX6 > H0XA9 > TMEM130 > HISTIH3G > ATP8A2 > NID2 


Hypo 


MMP26 > KRTAP 8- 1 > REG3A > C0R06 > LY96 


II Hyper 


LY96 > C10orf81 > KRTAP8-1 > MMP26 > REG3A > DEFB 1 19 > NMUR2 > MAGEB6 > IGKV7-3 < KRTAP15-1 


Hypo 


HTR2C > GRIK3 > CNTP2 > SPHKAP > TMEM132D > NEFH > LEP > ZNF177 > HOXD 10>NKX 6-2 


III Hyper 


TLX3 > H0XB4 > AJAP1 > H0XA4 > H0XD9 > PTGDR > LYPD5 >FZD10> HOXD 12 > NECAB2 


Hypo 


CHR6 > C13orf28 > TMEM156 > XDH > FGF6 > IVL > G6PC > KRTAP8-1 > C10orf39 > FCRL3 



genes in Stages I and III and hypomethylated genes in 
Stage II were mostly identified in TRUE CpG sites. Of 
the 34 common Significant DNA methylated genes across 
all stages (see Table 2), 25 were identified in TRUE CpG 
sites: AJAP1, ATP8A2, CYYR1, EVX1, FERD3L, GRIK3, 
GRM6, HAND2, HOXA9, HOXB4, HOXD9, HOXD4, 
HOXD12, OTX2, PRAC, PHOX2A, POU4F2, PTGDR, 
SIX6, SLC6A2, SPAG6, TBX20, TMEM130, TLX3, and 
ZNF560. These common genes were hypermethylated in 
Stages I and III respectively but hypomethylated in Stage 
II. Additionally, nine hypermethylated genes common to 
Stages I and III (see Table 2) were identified in TRUE 
CpG sites: GALR1, HLA-G, HOXA7, HOXD8, NID2, NPY, 
PAX7, SOX17, and VIPR2. The hypomethylated genes 
which were common to Stages I and II, REG3A, MMP26 
and KRTAP8-1 (see Table 2) were also identified in TRUE 
CpG sites. The CpG sites were analyzed for their role 
as promoter sites. This analysis identified 61/72, 80/93 
and 141/170 Significant DNA methylated genes across 
Stage I, II and III respectively in promoter sites. Also, 
the common Significant DNA methylated genes across 
all stages and between two stages (see Table 2) were 
identified in the promoter sites. Methylation of promoter 
regions in the gene correlate with low or no transcription 
[33]. Gene Ontology was used to correlate the transcrip- 
tion role of the Significant DNA methylated genes identi- 
fied in all of the three stages. Recall from Table 1 that 
there are 72, 93 and 170 Significant DNA methylated 
genes found in Stages I, II and III respectively of these 65 
were identified as TFs. Among the 34 common genes 
across stages identified in Table 2, 16 were identified as 
TFs. These 16 TFs were found to be hypermethylated in 
Stages I and III, and hypomethylated in Stage II. Figure 2 
describes the TF distribution profile across the different 
stages. Analysis of these TFs with respect to their CpG 



Table 4 Distribution of hyper and hypo-methylated genes 
in CpG islands 



Stage 


Hypermethylated 


Hypomethylated 


True 


False 


True False 


1 


53 


4 


1 4 


II 


4 


14 


60 4 


III 


88 


7 


4 45 



sites also identified all (except HBE1, HOXD 10, ORS1) 
mapped to the TRUE CpG sites. 

Figure 3A and 3B show the chromosome profile of 
Significant DNA methylated genes with respect to their 
hyper and hypomethylation. As shown in Figure 3A for 
Stage I the maximum number of hypermethylated genes 
were present on chromosome 7 while no hypermethylated 
genes were present on chromosomes 3, 9, 15 and 16. The 
Stage I genes identified on chromosome 7: EVX1, FERD3L, 
HOXA7, HOXA9, NPY, TBX20 and TMEM130 were 
common in Stages I and III; five of these were common 
across all stages (see Table 2). For Stage II, the max- 
imum number of hypermethylated genes was present on 
chromosome 8: MX2, KRTAP8-1 and KRTAP 15-1. Of 
these three, KRTAP8-1 was common across all the 
stages, and KRTAP15-1 common across Stages II and III 
(see Table 2). In Stage III, all chromosomes had atleast 
one gene identified as hypermethylated. As with Stage I, 
in Stage III also the maximum number of hypermethy- 
lated genes was identified on chromosome 7: CFTR, 
DGKI, EPO, EVX1, FERD3L, HOXA4, HOXA7, HOXA9, 
HOXA13, LEP, NPTX2, NPY, TBX20, TMEM130 and 
VIPR2. Of these genes five were common across all 
stages, three were common to Stages I and III; two were 
common to Stages II and III; and two in only Stage III 
(see Table 2). 

Figure 3B shows Stage II has a maximum number of 
hypomethylated genes on chromosome 7 also no hypo- 
methylated genes were identified on chromosomes 18 and 
19 for this stage. For Stage I, only one hypomethylated 
gene was identified in chromosomes 2, 8, 11, 17, 21. For 
Stage III, maximum number of hypomethylated genes 
was present on chromosome 1: CD IB, FCRL3, FLG, 
IVL, LCE1F, LCE2B, LCE3D, LCE4A, SPTA1 and ZP4. 
Also, these 10 hypomethylated genes on chromosome 1 
were unique to Stage III. 

Additional file 2 shows the pathway distribution of 
Significant DNA methylated genes across stages. This 
distribution depicts difference and commonality across 
stages in terms of pathways. The pathways associated with 
common Significant DNA methylated genes given in Table 2 
across all stages were Inositol phosphate metabolism, 
Neuroactive ligand-receptor interaction, Phosphatidylinositol 
signaling system and P53 signaling. The pathways associated 
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Figure 2 Profile of DNA methylated genes as transcription factors across stages. 



with common Significant DNA methylated genes (Table 2) 
across Stages I and II were Glycine, serine and threonine 
metabolism, Melanogenesis, Taurine and hypotaurine 
metabolism, P.E.coli infection, Shigellosis, Toll-like recep- 
tor, calcium signaling pathway and GnRH signaling path- 
way. The pathways associated with common Significant 
DNA methylated genes (Table 2) across Stages II, and III 
was Adipocytokine signaling pathway, Cytokine- cytokine 
receptor interaction, JAK-STAT signaling pathway, P53 
signaling pathway. The pathways associated with com- 
mon Significant DNA methylated genes across (Table 2) 
Stages I and III were Antigen processing, Cell adhesion 
molecules, Natural killer cell mediated cytotoxicity, Type I 
diabetes mellitus. In addition, focal adhesion pathway 
was associated with Stage II genes and Hedgehog path- 
way with Stage III genes. 



Network construction and analysis 

A systems biology approach was developed to understand 
the Significant DNA methylated genes and Significant 
expressed genes in each stage. The interactions for the Sig- 
nificant DNA methylated genes and Significant expressed 
genes were identified using BioGRID [34] and stage-specific 
networks were constructed. Table 5 shows the number of 
interactions of Significant DNA methylated genes across 
the three stages. In each stage, the interactions of Sig- 
nificant DNA methylated genes were analyzed with respect 
to Significant expressed genes. This analysis showed that 
Significant DNA methylated genes have interactions among 
themselves, with Significant expressed genes and additional 
genes present in BioGRID [34]. These additional genes 
were analyzed for their expression in all the stages to 
determine if DNA methylation affected their expression. 
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Table 5 DNA methylated gene interactions across stages 



Stage 


Num. of DNA methylated genes 


Num. of interactions 


1 


72 


228 


II 


93 


273 


III 


170 


660 



These interactions were termed as "missing links", and 
the additional genes as "novel genes". Table 6 gives the 
profile of the missing links and novel genes. Analysis of 
the 27 novel genes in Stage I for their significance in 
other stages indicated six of them in Stage II: ANXA7, 
APBB1IP, MDK, PFDN1, TINF2, TLE2; three in Stage III: 
CUL5, CTNNB1 and SQSTM1 and six in Stages II and III: 
CALM1, CTNNB1, c-JUN, SMAD1, TINF2. Of the 33 
novel genes in Stage II, two were associated in Stage III: 
A2M and CTNNBll; and ten genes in Stages I and III: 
FOXA2, HK3, NCF1, NRIP1, PDLIM1, SP1, SUMOl, 
TCF4, TLR4, and TNN. Analyses of the 83 novel genes 
in Stage III found three in Stage I: ELN, FAS and TEX11; 
seven in Stage II: ANXA7, APBB1IP, MDK, PFDN1, 
STAT3, TLE2, UBE2B and 34 in Stages I and IhBCR, 
DLG3m, DLG4, EGFR, DSP, MAFF, PICK1 etc. Table 7 
shows the profile of interaction of Significant DNA 
methylated genes identified in Table 2 and Table 3 with 
novel genes analyzed in this paragraph (given in Table 6). 

Figure 4 shows the stage-specific networks of Signifi- 
cant DNA methylated genes. From this figure, it can be 
seen that Stage III networks were more connected and 
dense as compared to other two stage networks. This 
suggests heterogeneity of LUAD network across stages. 
To compare stage-specific networks, subnetworks of 
Significant DNA methylated genes were identified and 
analyzed. SEED and expand algorithm (described in 
methodology) was used to identify these subnetworks. 
Additional file 3 lists the number of subnetworks with 
respect to the pathway class. These subnetworks were 
overlapping as the genes in them belonged to different 
pathway class. Additional file 3 shows that the number 
of subnetworks drastically increases from size four to 
size five in most of the stages, making it an NP-hard 
problem. This sharp increase in the number of subnet- 
work suggests that though the DNA methylated gene is 
not directly connected to a hub node, its interaction 
path has a hub node. This further indicates that a DNA 
methylated gene can influence the whole network of a 
given stage. Table 8 lists the subnetworks with greater 



number of connections identified in all three stages. As 
shown in Table 8, UBC and CUL1 were identified as 
hub gene across the three stages and their connectivity 
profile changes with pathway class. The other hub genes 
(number of connections) identified in Stages I and III 
(not shown in the Table 8) were: SIRT7 (6), CDK2 (5), 
PMS2 (4), SUM02 (3), SMAD3 (7), SMAD4 (5), and 
SMAD2 (4). The analysis also identified LY96 subnet- 
work in Stage I consisting of the hub gene TLR4 inter- 
acting with seven other genes. Though LY96 was also 
identified in Stage II, the comparative subnetwork was 
smaller, and this gene was not identified at all in Stage 
III. HLA-G was present in Stage I but not in Stage II; there- 
fore its subnetworks were missing. In Stage II and III, c-Jun 
a TF was identified as a hub gene. PHOX2A was the Sig- 
nificant DNA methylated gene associated with c-Jun in 
both stages. There was similarity across common genes 
(see Table 2) with table 8, depicting that subnetworks 
constructed out of common genes across or between 
two stages can be of significance to LUAD. The size four 
subnetworks were further compared across the stages to 
understand their commonality and uniqueness (Add- 
itional file 4). This size four subnetworks were ana- 
lyzed for their common Significant DNA methylated 
genes. The common Significant DNA methylated genes 
in this size four subnetworks were FOXG1, and PHOX2A 
(see in Table 2 also) and significant expressed genes were: 
FOXH1, FOX03, HAND2, MYC, RBI, SMAD2, SMAD3, 
SMAD4, and TP53. On analysis of genes in these sub- 
networks with respect to their pathway classes found 
some of them to be very specific to a given pathway class. 
A highly conserved common subnetwork of GRIK2, 
GRIK3, GRIK5, and GRID2 was identified across all 
stages belonging to the other pathway class. Of these 
GRIK3 was Significant DNA methylated in all the three 
stages (Table 2) and GRIK2 in Stages I and III (Table 2). 

Analysis of these subnetworks is an NP-hard problem 
because these are large open subnetworks. To reduce 
the complexity, the subnetworks were ranked based on 
their NodeStrength and EdgeStrength as given in meth- 
odology section. The top ranked, size four subnetworks 
of each stage (Table 8 and Additional file 3) were propa- 
gated and compared to identify the largest conserved 
subnetworks across the stages. This analysis identified a 
subnetwork of size 11 with seven conserved genes: UBC, 
KRAS, PIK3CA, PIK3R3, RAF1, BRAF, RAP1A (Additional 
file 5). The g: Profiler tool was used for the enrichment 



Table 6 Novel genes (Missing Link-methodology) discovered using BioGRID 

Stage Missing links Novel genes Num. of DNA methylated genes Num. of novel genes & stages where these are identified 

I 27 27 16 6 (Stage II, III), 6 (Stage ll),3 (Stage III) 

II 43 33 25 10 (Stage I, III), 2 (Stage III) 

III 1 32 83 32 34 (Stage I, II), 3 (Stage I), 7 (Stage II) 
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Table 7 Analysis of DNA methylated genes interacting 



with novel genes 

DNA methylated genes Stages 

I II III 

Common DNA methylated genes 

AJAP1 V V V 

F0XG1 V V V 

GRIK3 V V 

HAND2 V 
H0XD4 V V V 

PH0X2A V 

PRKCB V V V 

TLX3 V 

DNA methylated genes common with Stage I & II 

LY96 V V 

MSC V 

Hypermethylated genes common with Stage I & III 

GAS7 V V 

GRIK3 V 

HLA-G V 
H0XA9 V 
H0XD8 

H0XD12 V 
NID2 

TLX3 V 
DNA methylated genes common with Stage II & III 

SERPINB5 V V 

TALI V 
Top scored DNA methylated gene 

IVL V 
NEFM V 

TLX3 V V 



analysis on the top ranked subnetwork given in Table 9 
[35,36]. This analysis showed that these subnetworks to 
be enriched with common genes across stages (shown 
in Table 2), indicating that commonality across stages 
of LUAD can be critical in identifying the target genes. 
Figure 5 gives the Circos image of the number of hypo- 
methylated and hypermethylated genes and pathways class 
across chromosomes for each stages of LUAD. 

Figure 6 and Table 10 show the comparison results 
for the interaction networks from BioGRID and from 
the manually curated signaling network. As shown in 
the table and Venn diagram, there is a minimal propor- 
tion of overlap between the methylation and expression 
networks obtained from multiple sources (see Figure 6(A) 
& (B)). However, this overlap improves after applying 



the methylation-expression network integration criteria 
mentioned in methodology section (see Figure 6(C)). In 
addition, there is a significant amount of commonality in 
the subnetworks extracted from networks for BioGRID 
and the manually curated signaling network. The same set 
of conserved genes (KRAS, PIK3CA, PIK3R3, RAF1, BRAF, 
and RAP1A) was obtained in these networks except for 
UBC, which is missing from the resulting subnetwork of 
signaling network. 

Discussion 

Distribution and understanding of Significant DNA 
methylated genes across stages 

According to Table 1 and Figure 1, the maximum num- 
ber of Significant DNA methylated genes were identified 
for Stage III followed by Stage II and then Stage I. None 
of the genes in Stage IV met the filtering criteria; there- 
fore, no genes were identified as DNA methylated. From 
Table 1, it can be seen that hypermethylated genes were 
more prevalent in Stages I and III than in Stage II. 
Though this study identified 34 common Significant DNA 
methylated genes (see Table 2) across the three stages, 
most of them have not been reported previously in LUAD. 
The HOX genes that were common across the three 
stages are grouped into four HOX families, A, B, C, 
and D; equivalent numbered HOX genes (HOXA9, HOXB9) 
in each family groups (A, B, C, D) are paralogues. The 
analysis found HOXA4, HOXA9, HOXB4, HOXD9, and 
HOXD12 genes with high methylation value, suggesting 
these genes play an important role in all stages of 
LUAD. These genes are known to be involved in cell 
proliferation while preventing apoptosis and helping in 
survival [37]. Dysregulated behavior of HOX genes has 
been observed in ovarian cancer [38]. Early stage HOXA9 
methylation has been identified in lung cancer and used 
in early detection and prognosis [39,40]. Our analysis 
found HOX genes in all stages, with hypermethylation 
in Stages I and III, hypomethylation in Stage II. While 
no previous studies have associated the profile of HOX 
genes with stages, though re-appearance was identified 
and our analysis demonstrated this aspect. Another gene 
identified by our analysis across all three stages was PTGDR, 
which was highly hypermethylated in Stages I and III 
(Table 3). PTGDR has been negatively correlated with 
smoking [41] and methylated in colon cancer [42], how- 
ever, prior studies have not investigated its role in LUAD. 
POU4F2 and TLX3 were identified in all three stages, 
and TLX3 was highly methylated in Stages I and III 
(Table 3). Previous studies have found them as methyl- 
ated in leukemia and breast cancer respectively [43,44] 
but not in LUAD. Overexpression of TBX20, which was 
also identified in this study (see Table 3), has been re- 
ported in lung cancer [45]. EVX1 and OTX2 (see Table 2) 
were identified as methylated in NSCLC and lung 
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Figure 4 Stage-wise interactions of DNA-methylated genes with significant genes. 



cancer [46,47]. MMP26 has been associated with tumor 
development, invasion and metastasis of NSCLC but its 
methylation profile was not reported [48], our analysis 
showed it to be highly hypomethylated in Stage II 
(Table 3). There was no literature evidence about 
KPTAP8-1, REG2A, and SLX6 for their significance or 
methylation in lung cancer. 

Of the 12 common Significant DNA methylated genes 
common to Stages I and II, LY96 has been previously as- 
sociated with lung cancer [49]; ZNF577 and LVRN have 
been identified as methylated in lung cancer [47] and 
renal carcinoma, but not in LUAD [50]. LY96 was highly 
hypomethylated in Stage I and hypermethylated in Stage 
II (as shown in Table 3), suggesting further investigation 
into its role in LUAD. 

Of the 30 common Significant DNA methylated genes 
across Stages II and III, GRIK2 and NEUROG1 have 



been previously reported being DNA methylated bio- 
marker for lung squamous cell carcinoma [47], and a 
Stage I biomarker in lung cancer [46] respectively. How- 
ever, re-appearance of NEUROG1 in Stage III has not 
been previously reported. SERPINB5 and TALI have 
been identified as methylated in NSCLC [30,46,51]. LEP 
has been reported as biomarkers in breast cancer [52], 
though not in lung cancer. The other highly hyper- 
methylated genes across Stages II and III identified in 
this study (as shown in Table 3) were AJAP1, HOXB4, 
MMP26, NMUR2, REG3A, TLX3, etc. and hypomethy- 
lated genes were FCRL3, GRIK3, HTR2C, IVL, NKX6-2, 
etc. Literature validation of these genes with respect to 
their importance in LUAD and other cancers found 
NMUR2 to be overexpressed in pancreatic cancer [53], 
A] API epigenetically silenced in Glioblastoma [54]. Also, 
AJAP1 was correlated with susceptibility in lung cancer 
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Table 8 Analysis of hub genes in the DNA methylated subnetworks of size 4 

Stage Subnetwork Connectivity profile of the hub node across pathways 



Cancer Lung cancer Signaling Metabolic + others 



1 (i) PHOX2A*:HAND2*:PPP2R5D:UBC 


23 


13 


71 


288 


(ii) HLA-G*:COPB1:TRIM37:UBC 










(Hi) LY96*:TLR4:SIGIRR:UBC 










(iv) HLA-G*:C0PB1:UBC:CUL1 


21 


14 


35 


114 


(v) F0XG1*:F0XH1:SMAD3:CUL1 










(vi) HLA-G*:C0PB1:UBC:SKP2 


19 


10 


17 


38 


II (i) PHOX2A*:c-JUN:SUM03:UBC 


23 


14 


74 


254 


(ii) TAL1*:HDAC1:IRF5:UBC 










(Hi) PRKCB*:HIST1H3I:CUL4A:CUL1 


20 


14 


74 


253 


1 1 1 (/) PH0X2A *:c-JUN:SUM03:UBC 


22 


10 


64 


235 


(ii) PHOX2A*:HAND2*:PPP2R5D:UBC 










(Hi) PRKCB*:HIST1H3I:CUL4A:CUL1 


20 


13 


35 


102 



*: identified in Table 2. 



[55]. GRIK3 was correlated with breast cancer and being 
considered as diagnostic for lung cancer [56]. Not much 
literature evidence has been reported for the presence of 
VSX1, NKX 6-2 in cancer or their methylation. 

Of the 42 common Significant hypermethylated genes 
unique to Stages I and III, GALR1, NID2 have been 
identified as highly methylated in NSCLC [46,55], PAX7 
has been identified in lung cancer but not reported with 
methylation [57], though PAX family genes have been 
previously reported being methylated in cancer [58]. Re- 
cent studies have reported SOX17 methylation in lung 
cancer [59], but not at the stage level. Additionally low 
expression of GAS7 has been reported in lung squamous 
cell carcinoma suggesting its importance as marker [60], 



but this gene has not been previously found to be meth- 
ylated in LUAD but reported as methylated in colorectal 
cancer [61]. 

Of the four Significant hypomethylated genes in Stage 
I and III, in addition to KRATAP8-1 and MMP26 which 
were analyzed in the above section, COR06 was also 
hypomethylated and has been reported as an epigenetic 
gene in renal cell carcinoma but not in LUAD [62]. 
However, not much detail for REG3A with regards to its 
presence in LUAD was available. 

In addition to genes that were found to be common 
across all or two stages, several genes were unique to 
one stage only, and these need further evaluation. From 
Stage I, NEFM has been reported as a biomarker and as 



Table 9 Enrichment analysis of the top scored subnetworks 



Stage Biological Process p-value Genes in subnetworks 

Common across Activation of MAPKK activity 1 .1 2E-03 PH0X2A*, HAND2* PPP2R5D, UBC, KRAS, PIK3CA, PIK3R3, 

all Stages RAF1, BRAF, RAP1A 

I & II Co-SMAD binding 1 .2E-05 FOXG1* FOXH1, SMAD2, SMAD1, MED15, UBC, KRAS, PIK3CA, PIK3R3, 

RAF1, BRAF, RAP1A 

II & III Nerve growth factor receptor signaling pathways 2.21 E-04 HOXD4*, INPP5B, SLC6A2, STX1A, VAMP1, UBC, KRAS, PIK3CA, PIK3R3, 

RAF1, BRAF, RAPIA 

I & III Positive regulation of peptidyl-serine phosphorylation 1 .42E-03 NPY* NPY1R, LSM7, NR1H2, RMI1, UBC, KRAS, PIK3CA, PIK3R3, 

RAF1, BRAF 

I Transmembrane receptor protein tyrosine kinase 1 .87E-03 HLA-G* COPB1, UBC, KRAS, PIK3CA, PIK3R3, RAF1, BRAF, RAPIA 

signaling pathway 

II DNA helicase complex 6.8E-05 SERPINB5*, UCHL5, ACTR8, ACTR5, UBC, KRAS, PIK3CA, PIK3R3, 

RAF1, BRAF 

III Nerve growth factor receptor signaling pathway 5.5E-04 HOXB4*, CREBBP, KLF13, UBC, KRAS, PIK3CA, PIK3R3, RAF1, 

BRAF, RAPIA 



*: commonality with Table 2. 
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Figure 5 Circos image showing the stage-wise distribution of hypermethylated, hypomethylated genes and pathway class in the 
chromosomes. Outer circle represents the chromosomes, the first inner circle represents the hypermethylated genes in each chromosome, 
and the second inner circle represents the hypomethylated genes on each chromosome and inside is the pathway class to which the gene on 
the respective chromosome belongs. 



a methylated gene in cancer [63,64], however, it was not 
reported in LUAD. From Stage III, IVL has been identi- 
fied as overexpressed in cancer [65] but not in LUAD. 
Overall our methodology identified both known and 
novel DNA methylated genes that were significant across 
all three stages of LUAD. Also, our analysis found that 
most of the DNA methylated genes that were common 
across all stages were highly methylated in the respective 
stages (Table 2 and Table 3), and many of them were re- 
ported as oncogenes. 

Distribution of the Significant DNA methylated genes in 
and outside of CpG islands, promoter regions, 
transcription factors, chromosomes and pathways 

Our initial distribution analysis found that most of the 
Significant DNA methylated genes across the three 
stages were present in the TRUE CpG sites (as shown 
in Table 4), stating the role of these sites in methylation. 
In Stage III, 45 hypomethylated genes were present in 
FALSE CpG islands. The false sites could be further vali- 
dated using other databases or clinical features. The ana- 
lytic procedure in this analysis identified 30 of the 34 



common Significant DNA methylated genes (found in 
Table 2) and 79 unique hypermethylated Stage III genes 
in the promoter regions. This correlation of the promoter 
region with common Significant DNA methylated genes 
further demonstrates that the genes with higher CpG 
islands in the promoter region were methylated across 
the stages of LUAD. Therefore, further analysis can be 
done to better understand these promoter regions func- 
tionally with respect to their conservation (motifs) as 
these can be co-regulated. 

DNA methylation is closely linked with gene regula- 
tion, particularly with transcriptional activity. It has been 
reported that DNA methylation can prevent gene activa- 
tion and restrict expression for correct developmental 
stage [66]. It can also interfere with binding of TFs by 
changing the recognition sites involving cytosine [67]. 
Most TFs require CpG-rich sites to bind to DNA and 
methylation of these sites might interfere with the bind- 
ings. This study identified TFs in all three stages and 
these TFs were associated with TRUE CpG sites. As TFs 
have been identified as potential biomarkers for different 
diseases, the unique TFs identified for each stage were 
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(A) Methylation network overlap 

bio_stage1 sig_stage1 bio_stage2 sig_stage2 bio_stage3 sig_stage3 




(B) Expression network overlap 

bio_stage1 sig_stage1 bio_stage2 sig_stage2 bio__stage3 sig_stage3 




(C) Expression-Methylation network merge overlap 

Figure 6 Comparison of different types of networks obtained from BioGRID and manually curated signaling network. The "bio_stage" 
refers to BioGRID and "sig_stage" refers to signaling network. The Venn diagram depicts the overlap of genes between the two networks 
obtained from the two different sources, BioGRID and Signaling network. A) Venn diagram of the methylated genes for Stage I, Stage II, and 
Stage III. (B) Venn diagram of expression genes for Stage I, Stage II, and Stage III. (C) Venn diagram of methylated-expression genes for Stage I, 
Stage II, and Stage III. 



analyzed for their significance in LUAD using the litera- 
ture. Of the 16 TFs common across the stages, four 
(EVX1, HOXA9, OTX2, TLX3) have already been discussed 
in earlier sections. Table 11 lists the significance of the 
remaining few common and unique TFs across the stages 
with respect to their association in lung cancer, other 
cancer (not lung) and/or prognostic value. From this 

Table 10 Percentage of genes overlapping from signaling 
network with BioGRID network 





Stage 1 

(%) 


Stage 2 

(%) 


Stage 3 

(%) 


Methylation network 


17.1 


19.1 


20.2 


Expression network 


49.45 


27.32 


40.75 


Expression-Methylation merged network 


85.9 


85.75 


84.60 



table, it can be seen that almost all the TFs are consid- 
ered as prognostic markers for lung cancer. Our study 
identified these TFs as epigenetically modified across 
the stages of LUAD, and, given their significance in can- 
cers (other or lung), they could be considered for future 
studies as potential targets for LUAD. 

Earlier studies have reported chromosome 6 and 15 to 
play an important role in lung cancer [84,85] also certain 
chromosomal regions were more hypermethylated [15]. 
On mapping, the hypermethylated and hypomethylated 
genes to their respective chromosomes, it was observed 
that some chromosomes were common across all the 
three stages (Figure 3 A and Figure 3B). Chromosome 7 
was identified in all three stages, with the maximum 
number of hypermethylated genes in Stages I and III. Six 
common Significant DNA methylated genes of Table 2 
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Table 1 1 Significance of transcription factors in LUAD 


Stage/DNA methylated genes identified as TFs 


Significance 


Common across Stages (Table 2) 




rUAb 1 


Cell adhesion, Growth and invasion of lung cancer [68] and is prognostic marker in bladder cancer [69] 


UA NIDI 


Identified in early stage in squamous cell carcinoma but not in adenocarcinoma [70] 


HUadh 


Overexpression identified in ovarian cancer though HOX genes are reported in lung cancer [71] 


PH0X2A 


Abnormal methylation in NSCLC [72] 


Unique TFs in Stage 1 (total 6) 




HLA-G 


Potential biomarker in lung cancer [73] 


BCL11B 


A new therapeutic for T-cell malignancies but direct correlation with lung cancer not 




available [74,75] 


UTF1 


Not yet correlated in lung cancer but a prognostic in cervical cancer [76] 


Unique TFs in Stage II (total 6) 




EMX2 


Is associated with WNT signaling pathway and its down-regulation is associated with methylation of 




promoter region in lung cancer suggesting it as novel suppressor gene for human lung cancer [77] 


NKX6-2 


Identified as methylated in lung cancer [40] and identified as target in Pancreatic cancer [78] 


0LIG2 


Identified in lung cancer but not study as target [79] 


ZNF577 


Identified as methylated in various cancers like breast and oropharyngeal squamous cell carcinoma, 




very recently in lung cancer [80]. Not much is reported with respect to its prognostic value. 


Unique TFs in Stage III (total 13) 




EPO 


Is the key regulator in the production of RBC, methylation of the promoter section of EPO is 




identified in many cancers including lung, breast liver etc. [81] though its prognostic value 




efficiency is not reported. 


GERM1 


Hypermethylation of GR EMI is identified for prognostic significance in renal cell carcinoma [82], 




so far not in lung cancer. 


IRXl 


Methylation is identified in lung cancer [83] but needs to be studied for prognostic markers. 



were identified on chromosome 7 including the HOX 
genes which cluster on chromosomes 2, 7, 12 and 17. 
Our analysis found chromosome 7 to be highly epige- 
netically modified. Some of the other methylated genes 
located on this chromosome found in the Stages I and 
III were EVX1, FERD3L, NPY, TBX20, and VIPR2. NPY 
was found to be highly expressed in prostate carcinomas 
[86] while the significance of the others genes was dis- 
cussed in the previous section. Another Significant DNA 
methylated gene common across Stages II and III (Table 2) 
identified on chromosome 7 was LEP. This gene is known 
to be associated with advanced lung cancer (http://www. 
ncbi.nlm.nih.gov/gene/3952). Genes on chromosome 7 
have also been reported to associate with different can- 
cers including gastric cancer, and prostate cancer [87,88]. 
Chromosome 7 genes AKT and PTEN are used as prog- 
nostic markers for NSCLC [89], suggesting that chromo- 
some 7 genes that have been identified across stages of 
LUAD as methylated can be considered for prognostic 
significance in LUAD. Our analysis also found Chromo- 
somes 17 and 14 to be associated with a large number 
of hypermethylated genes in Stages I and III. Chromo- 
some 17 has been previously studied and associated 
with NSCLC [90]. Chromosome 14 has been associated 
with genetic variation in lung cancer [91]. In Stage III 



chromosome, 10 was identified with nine hypermethy- 
lated genes and these were: LBX1, NKX6-2, PTF1A, 
SLC18A3, SORC3, SPAG6, C10orf26, and C10orf82. The 
ladybird homeobox 1 (LBX1) gene has been associated 
with the breakpoint regions involved in T-cell leukemia 
[92] and methylated in prostate cancer [93]. However, 
not much has been studied and reported about LBX1 
methylation and association in lung cancer. Similarly 
NKX6-2 is a methylated biomarker for bladder cancer 
but its importance and methylation has not been stud- 
ied in LUAD [94]. 

The analysis found that chromosomes with the highest 
number of hypermethylated genes in Stage I also had 
the highest number of hypomethylated genes in Stage II 
and eventually have the highest number of hypermethy- 
lated genes in Stage III (Figure 3 A and Figure 3B). Also 
in Stage II a large number of hypomethylated genes 
were identified in almost all the chromosomes. This ob- 
servation suggests a distinct methylation pattern across 
the three stages of LUAD, and since methylated genes are 
present on certain chromosomes in cancer, further indi- 
cating that epigenetics plays an important role in LUAD. 

The pathway analysis depicted the onset of differ- 
ent epigenetically modified pathways across stages. 
The common signaling pathway identified across the 
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three stages shown in Additional file 2 was: Adipocy- 
tokine signaling and Phosphatidylinositol signaling; 
across Stages I and II were: Toll-like receptor, Cal- 
cium signaling, GnRH signaling pathway; across 
Stages II and III were: JAK-STAT and PS3 signaling 
pathways. It has been reported that genes silenced due to 
promoter methylation were mostly tumor suppressor 
genes [15], and silencing of these genes can eventually 
affect all the pathways especially cell cycle, DNA repair 
genes, apoptosis, signaling etc., which could lead to tumor 
progression. An example of this propagation was gene 
LY96 which was identified in our analysis as hyper- 
methylated in Stage I and hypomethylated in Stage II 
(Table 2). In Stage I, the interacting genes were TLR2, 
TLR4, while in Stage II its interacting genes were TLR2, 
CALM1 and UBC. LY96 encodes MD2 a molecule im- 
portant for the activation of TLR4, which promotes sur- 
vival [95] and Toll-like pathways connect to the 
immune system [96]. DNA methylation of LY96 might 
prevent the activation of TLR4 in Stage II, which in turn 
would affect the activation of Toll-like pathway. Since 
cancer cells evade the immune system, reversing the 
epigenetic behavior of LY96 needs to be further evalu- 
ated as it could result in the activation of TLRs which 
would be beneficial. Similar analysis can be carried out 
for the other pathways that are common across stages 
as these have also been identified as important cancer 
signaling pathways [74,97]. In addition, the focal adhe- 
sion pathway associated with Stage II has been reported 
to be involved with multiple signaling events in lung 
cancer, suggesting that methylation of this pathway 
might also affect the signaling pathways [98]. Hedgehog 
pathways in Stage III have been identified as a subset 
of NSCLC and are being investigated for clinical trials 
[99]. Our analysis also found that metabolic pathways 
are co-related with the DNA methylated genes in each 
stage, underscoring the fact that methylation affects im- 
portant pathways in LUAD. Our analysis also depicted the 
early and late methylated affected pathways. This analysis 
demonstrates that targeting the epigenetic genes in these 
pathways might be effective for LUAD. 

Understanding the DNA methylated stage-specific networks 

Effective drug target identification in a disease now 
requires incorporating knowledge of the epigenetic genes 
with knowledge of other biological features. Biological 
networks help understand and elucidate the roles of 
the molecular entities individually and collectively. 
Therefore, the epigenetically modified genes identified 
in our study were further analyzed in terms of their 
interaction partners across different stages of LUAD. 
This network analysis can help to recognize patterns 
that were not visible by exploring the expression data 
alone and help to illustrate the conserved and unique 



patterns across the stages of LUAD. These patterns 
could then be further validated in laboratories for their 
efficacy as drug targets. 

The missing links and novel genes of Table 6 were 
identified in subsequent or other LUAD stages. These 
novel genes were analyzed for their interacting partners. 
Table 7 shows the common Significant DNA methylated 
genes across all stages, having interactions with these 
novel genes (missing links), were: A] API, FOXG1, GRIK3, 
HAND2, HOXD4, PHOX2A and PRKC. Most of these 
DNA methylated genes were analyzed for their signifi- 
cance in the previous sections. Analyzing the associa- 
tions of the novel genes identified the following TFs: 
c-Jun, SMAD1, STAT3, and others genes like EGFR, 
BCR, SUMOl, CALM1, CULS, CTNNB1 etc. The TFs 
identified as novel for a given stage play an important 
role in cancers [100]. C-Jun was identified as important 
TF in cancer and its subnetwork has been identified in 
Stages II and III; which was discussed in the previous 
section. EGFR mutations were associated with NSCLC 
(www.egfr.org). CALM1 and CTNNB1 were studied in 
NSCLC and lung cancer [101,102]. This brief analysis 
elucidates that novel genes interacting with epigenetic genes 
can play an important role in LUAD; further highlight- 
ing that it is essential to understand stage-specific networks. 

To understand the commonality and uniqueness of 
the Significant DNA methylated genes in the context of 
the other significant expressed genes, we developed and 
performed subnetwork analysis (as described in the meth- 
odology section). Subnetworks of each size were analyzed 
with respect to their hub genes. Table 8 shows the hub 
gene profile across different stages in the size four sub- 
networks. From this table, it can be seen that UBC and 
CUL1 were hub genes across top ranked stage-specific 
subnetwork; COPB1, FOXH1, SMAD3, TLR4 for Stages I; 
HDAC1 for Stage II. Also SIRT7, SUM02, LY96, c-Jun 
were hub genes in different stages. Additionally each sub- 
network had at least one TF. This analysis also confirms 
that epigenetic genes are not usually hub genes, but have 
a direct correlation with TF. Also TFs are usually the 
hub nodes and play an important role in cancers [100], 
meaning that targeting DNA methylated genes is advanta- 
geous as it would not disrupt the whole network but can 
induce the necessary changes to restore the functionality. 

Analysis of the genes in the subnetworks (Additional 
file 4) found FOXG1 and PHOX2A (Table 2) to be com- 
mon across all pathway classes. FOXG1 is already a sig- 
nature gene for lung cancer [68]. All these subnetworks 
consisted of at least one TF and these were: HAND2, 
MYC, SMAD2, SMAD3, SMAD4, and TPS3; which are 
important in cancer [70,100,103-105]. The other genes 
AR } ATF2 } CUL1, EP300, GATA4 } LEF1, SKP2 in these 
subnetworks have also been identified as important in 
lung cancer [106-112]. Our analysis identified the highly 
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conserved common subnetworks of GRIK2, GRIK3, GRIK5, 
and GRID2 in the metabolic pathway class. GRIK3 has 
been reported to be associated with breast cancer and is 
also in consideration for diagnostic value in lung cancer 
[56]. In addition, the analysis also identified some sub- 
networks with novel genes (see Table 6). This analysis 
suggests epigenetic genes can be used to target lung 
cancer genes and identification of epigenetic subnet- 
works can aid in stage- wise characterization of LUAD. 

The top ranked subnetworks of size four in each stage 
were propagated based on their SubnetworkStrength to 
identify the largest conserved subnetwork across stages. 
Analysis of the different subnetworks found that a set of 
seven genes was conserved across the stages. The size of 
the subnetworks with these conserved genes was 11, 
and the seven genes were: UBC, KRAS, PIK3CA, PIK3R3, 
RAF1, BRAF, and RAP1A. From Table 8, it was seen that 
UBC was the hub gene in all stages. UBC is considered to 
be a reference gene for lung cancer, though it interacts 
with important cancer genes like EGFR, PCNA, IRAKI, 
and PS3 [113,114]. Since UBC is involved in ubiquitina- 
tion, it is responsible for cell death and general mainten- 
ance. In this analysis, UBC expression was found in all 
stages, suggesting that its function if disrupted can result 
in uncontrolled cell division, a key feature of cancer. 
The BRAF gene encodes a RAS regulated kinase that 
mediates cell growth. Recent studies have identified 
BRAF mutations in NSCLC [115]. Phosphatidylinositol 
3 '-kinase (PI3K) is a heterodimer that consists of cata- 
lytic and regulatory subunits. PIK3CA is one of the cata- 
lytic subunit genes and PIK3R3 is one of the regulatory 
subunit genes: both of these genes were present in the 
conserved subnetwork. PIK3CA mutations have been 
identified in many cancers. The PIK3CA pathway con- 
sist of the KRAS and EGFR genes which are important 
targets for many cancers [100], mutations of PIK3CA 
have been also identified in lung cancer [116]. PIK3R3 
expressions have been associated with cancers like glio- 
blastoma and ovarian cancer in prior studies [117], and 
recent studies have identified PIK3R3/AKT as the target 
of lung cancer molecule miR-7 which affects TLR9 sig- 
naling (TLR was discussed in a previous section) [118]. 
This analysis of the conserved seven genes in the DNA 
methylated genes subnetworks of size 1 1 elucidated that 
methylation can affect important LUAD genes and enrich- 
ment analysis described the important biological processes 
associated with these subnetworks. From Table 9, it can 
be seen that these subnetworks affect the important sig- 
naling and metabolic cancer processes. Additional file 4 
list the DNA methylated genes associated with these 
subnetworks. Therefore, it can be concluded that further 
laboratory validation of epigenetic genes in these con- 
served subnetworks might be useful in recognizing a 
novel target of LUAD that can be universal to all stages. 



Effectiveness of methodology to extract significant sub- 
networks for networks obtained using variable dataset 
sources is proved by the comparison results showed in 
Table 10 and Figure 6. In addition, the same set of con- 
served genes was identified by the algorithm which proves 
the robustness of the analysis pipeline. 

Conclusions 

The study was entirely based on the available TCGA 
data, which has the limitation of unequal samples; still 
we were able to prove the advantage of integrating epi- 
genetic data, expression data and protein-protein inter- 
action knowledge for advancing of systematic understanding 
of LUAD. This understanding can be further improved by 
incorporating the system biology approach to the epigenetic 
profile across the different stages of LUAD. The study 
identified 72, 93 and 170 epigenetic genes across Stages I, 
II and III. A set of 34 common epigenetic genes were 
identified across the three stages, and it was observed that 
methylation patterns were similar across Stages I and III, 
but were different in Stage II. The study also identified 
known, and novel epigenetic genes across stages that 
were important in LUAD, these genes could be further 
validated in the laboratory for their scope as targets. The 
novel epigenetic genes identified were PTGDR, POU4F2, 
TLX3, and MMP26 along with the study identified early 
and late expression profiles of NEUROG1, A] API, and 
COR06 in LUAD. System biology approach stated that 
epigenetic genes were not the hub nodes but could still 
affect the hub genes in the networks, eventually playing 
a critical role in the disease mechanism. Subnetworks 
of size 11 with seven conserved genes across the three 
stages were literature validated, confirming their im- 
portance in LUAD. Therefore, it can be concluded that 
integrating methylated data with expression data can 
be useful for comprehending in-depth disease mechanism 
and for the ultimate goal of better target identification. 

Methods 

The gene expression and DNA methylation data for LUAD 
were downloaded from TCGA [119]. The gene expression 
data were generated by UNC AgilentG4502A_07_3, 
and the methylation profiles were generated by Illumina 
HumanMethylation27 DNA Analysis which contains 
27,578 CpG dinucleotides in 14,495 genes. These data- 
sets were downloaded on 10-12-2012 and segregated 
with respect to the stages in LUAD. The protein-protein 
interaction was downloaded from BioGRID [34]. The data- 
set from BioGRID comprised of 15,550 proteins and 86,344 
interactions. In addition to protein-protein interactions, 
manually curated human signaling network [120-123] was 
used to verify the effectiveness of the analysis pipeline. 
The signaling network consisted of -6,300 proteins and 
63,000 signaling relations (http://www.bri.nrc.ca/wang/). 
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The overall methodology for the stage-wise identifica- 
tion of LUAD process is shown in Figure 7, and it in- 
cludes four steps (A-D) as given below; 

Step A: the gene expression data from UNC 

AgilentG4502A_07_3 were analyzed based on 
the log 2 values to obtain the differentially 
expressed genes. 

Step B: the methylation data from Illumina 

HumanMethylation27 for each stage were 
analyzed based on the beta value to obtain the 
differentially methylated genes. 

Step C: the data obtained from Step A and B was 

integrated to obtain a stage-specific network 
of LUAD. This network was annotated with 
the topological and biological features for 
analyzing the methylated patterns. 



Step D: the stage-specific subnetworks were obtained 
for LUAD. 

Following sections contain details of each step. 

Step A. Identification of significant genes from 
expression data 

The level 3 data available from TCGA [119] was segre- 
gated based on the stages provided in Metadata. If a gene 
was logi > 1.4 or < - 1.4, then the gene was considered for 
further analysis as it obeyed the stringency with respect to 
fold change > 2.5 (a log 2 ratio of 1 represents a 2-fold 
change) [124]. These genes were termed as "Significant 
expressed genes". The average value for each of these genes 
was then computed and considered for the next level ana- 
lysis. If a gene was represented by two or more probes, then 
the median of its expression value was used. 



0 mpdp w t I MM) MM 




Figure 7 Overall methodology. (A) UNC AgilentG4502A_07_03 gene expressed data was analyzed based on the log 2 values to obtain the 
differentially expressed genes. (B) The methylation data from Illumina HumanMethylation27 were classified for each stage. Significantly DNA 
methylated genes were identified. (C) Stage-specific interaction networks were constructed and annotated with their DNA methylated genes. The 
nodes and edges of each network were scored based on the topological and biological features. (D) The epigenetic subnetworks were identified 
and compared across stages to understand the epigenetics commonality and uniqueness. 
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Step B.I. Identification of significant DNA methylated 
genes from methylated data 

The beta-values [125], for normal and disease samples 
were downloaded from the TCGA for Illumnia Human- 
Methylation27 and stratified across stages. The difference 
between the normal and the disease beta-values were then 
calculated. Genes with beta-values greater than 0.25 and 
those with beta-values less than -0.25 were considered for 
further analysis and were classified as hyper and hypo- 
methylated [125]. For the study, the analysis of q-value 
and 1% FDR gave threshold for the p-values obtained in 
respective stages [32]. The threshold was then used to 
filter the data and significant DNA methylated genes 
were derived using the Mann- Whitney U test [126], 
p-values were computed for each gene. Mann- Whitney 
U test was considered as it can handle variance for 
unequal sample sizes [127]. These genes were termed as 
"Significant DNA methylated genes". Since the sample 
sizes were small, to get true inferences resampling tech- 
nique was performed. The samples were permuted large 
number of times (1000), and Mann Whitney Test was 
performed on them [128] to get p-values [119]. 

Step B.2. Analysis of DNA methylated genes with respect 
to the CpG islands, promoter regions, transcription 
factors, chromosome distribution and pathways 

The correlation of DNA methylated genes with CpG islands 
is assessed by mapping the position of the DNA methylated 
gene to CpG position using computational methods [17]. 
For this study, the significant DNA methylated genes from 
each stage in LUAD as identified in the previous steps, were 
mapped to the CpG islands provided by Illumnia Human- 
Methylation27. These were then classified as TRUE or 
FALSE based on their location inside or outside of the CpG 
islands. The CpG islands were then correlated to the 
promoter region by computing the distance between the 
transcriptional start sites (TSS) (http://genome.ucsc. 
edu) and the promoter region. For this study, the pro- 
moter region was defined as -1500 to +500 bp around 
the TSS site [18]. The Significant DNA methylated genes 
were also analyzed for their transcription functions 
using Gene Ontology [121] and chromosome distribution 
(www.genome.ucsc.edu). This analysis gave the profile of 
Significant DNA methylated genes as TFs and their 
chromosome distribution. To understand the stage-wise 
profile of pathways consisting of Significant DNA meth- 
ylated genes, these were then annotated with respect to 
their pathway association using KEGG [74,129]. This ana- 
lysis found common and unique pathways across stages. 

Step C. Understanding the stage-specific networks of LUAD 

To understand the significance of the DNA methylated 
genes in LUAD, stage-specific networks were obtained 
using the following steps: 



Identification of gene-gene interactions and DNA 
methylated-gene interactions from BioGRID and constructing 
the stage-specific networks of LUAD 

The gene-gene physical interactions (associations) for all 
the Significant expressed genes and Significant DNA 
methylated genes were identified using BioGRID for all 
stages [34] . The networks for each stage were constructed 
based on these interactions. The nodes of the network 
were genes and the interactions between them were the 
edges. The nodes and edges were then analyzed to capture 
the commonality and differences across the stages. These 
were computed based on the following criteria: (i) Identi- 
fying edges (interaction) between two Significant DNA 
methylated genes (nodes); (ii) Identifying edges (interaction) 
between Significant DNA methylated gene (node) and 
Significant expressed gene (node); (iii) Identifying edges 
(interaction) between the Significant DNA methylated 
gene (node) and another gene (node) other than the sig- 
nificant expressed and DNA methylated genes in the 
given stage. This interaction was termed as the "missing 
link" and the gene as "novel gene". The expression 
pattern of this novel gene was then evaluated in the 
previous or subsequent stages. The significance of novel 
genes with respect to LUAD was validated using Biomed- 
ical literature. 

To understand the overall profile of each stage-specific 
network of LUAD, a system s biology approach was de- 
veloped. All the nodes (genes) and edges (interactions) 
were annotated with their respective topological and bio- 
logical features. The statistical computing tool R (www. 
r-project.org) was used to compute the topological fea- 
tures of betweeness and clustering coefficient. The two bio- 
logical features considered for the analysis were: Pathway 
Significance Score and Gene Ontology Semantic Similarity 
Score. The Pathway Significance Score was based on the 
occurrence of the given gene in a pathway class. For the 
study, the KEGG pathways were classified in the three 
pathway classes and these were (i) the lung cancer path- 
ways, (ii) other cancer pathways (not lung cancer), (iii) other 
pathways [129]. Each node (gene) in the network was 
annotated with betweeness, clustering coefficient and 
Pathway Significance Score. These features were normalized 
individually and the average of these features was com- 
puted. This average was termed as NodeStrenght, given as: 

NodeStrengthy 

_ {Betweenness + Clustering coefficient + Pathway Significance Score) 
~ 3 ' 

(i) 

Betweenness of a gene v was defined as the inverse 
of the ratio of the total number of shortest paths from 
gene s to gene t given by a st to the number of total 
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paths passing through gene v (a st (v)) [130]. This was 
computed as: 



Betweenness (By et (v)) = ^ 



<r s t (v) 



(ii) 



Clustering coefficient (CJ was defined as a function 
based on the triplets of the genes in the network, where 
a triplet consisted of the three genes (nodes) connected 
by either two open or three closed undirected ties [131]. 
The clustering coefficient for the genes in the undirected 
graph (stage- specific network) was computed as: 

For a graph G=(V,E) consisting of vertices Fand a set of 
edges £, where e U j connects vertex v t with vertex Vj and the 
neighborhood N t for this vertex v t was defined as: 



Ni = {vj : eyeE}. 



(iii) 



And where k t represents the number of vertices in the 
neighborhood of N t . The clustering coefficient for this 
local graph was then computed as: 



Clustering coeficeint (C v 



2 \ {e jk : Vj,v k eN h e jk eE}\ 



ki (kt-l) 



(iv) 



Pathway sifnificance score 

The pathways associated with each nodes v (genes) were 
identified using KEGG [74,129], and Pathway Sifnificance 
Score was computed as; 



Pathway Significance Score v - 



f frequency of term 
\ Total frequency 



* 100 



*(Totalfrequency) strength ^ 



strength. 

(v) 



Where, Pathway Sifnificance Score determined the level 
of importance of a gene in the lung cancer pathways, other 
cancer pathways (not lung cancer pathways) and other 
pathways (i.e. pathways that are not termed as lung cancer 
pathways or non-lung cancer pathways) as given by KEGG 
pathways; frequency of terms equaled the count of the gene 
in lung cancer pathways, other cancer pathways and other 
pathways; Total frequency was equal to the count of the 
lung cancer pathways, other cancer pathways and other 
pathways; Strength represents the rank of the pathway 
class to which the gene belongs to in the stage-specific 
network. For all the stage-specific network lung cancer 
pathway was given a rank of 3, other cancer pathways 
were given a rank of 2 and other pathways were given a 



rank of 1, of which 3 being the rank of the highest im- 
portance followed by 2 and 1 being the lowest rank. 

EdgeStrength 

For any two interacting nodes (genes) in the network, 
EdgeStrengthwzs computed based on their Gene Ontology 
Semantic Similarity Score. This was calculated using the 
GOSemSim package R [132]. 

All the genes and their edges in the stage-specific net- 
work were then annotated with their NodeStrength and 
EdgeStrength. The Significant DNA methylated genes were 
ranked based on their NodeStrength. The highly ranked 
DNA methylated genes were used to identify subnetworks 
as described in the following section. 

Step D. Identification and scoring of epigenetically 
relevant subnetworks across stages 

To compare and elucidate the interaction network of 
Significant DNA methylated genes across stages is a hard 
problem. Therefore, the networks were analyzed using 
graph techniques by identifying the relevant subnetworks 
[133-135]. In this work, subnetworks of different sizes 
were identified and analyzed across the stages to under- 
stand the functional importance of the Significant DNA 
methylated genes. For the study, we define a subnetwork 
as a group of connected nodes (genes) with at least one 
Significant DNA methylated gene, where any two asso- 
ciated genes had the Gene Ontology Semantic Score > 
60%. These were open subnetworks i.e. no size and 
shape limitation; therefore a large number of subnet- 
works were identified making it an NP-hard problem. 
Starting with the top ranked Significant DNA methylated 
gene as a seed, its associations were identified, propagated 
based on Ontology Semantic Similarity Score >60%, and 
analyzed with respect to the KEGG pathways. All the 
genes in a given subnetwork were understood based on 
the four categories: (i) genes identified in cancer pathways 
other than lung cancer pathways, (ii) genes identified in 
lung cancer pathways, (iii) genes identified in signaling 
pathways (not present in (i) and (ii)) and (iv) genes in the 
metabolic pathways and other pathways. These subnet- 
works correlate to distinct functions that specify the dis- 
tinct mechanism that were compared across the stages. 

The Significant DNA methylated genes in each stage 
were ranked based on their beta-value. The Significant 
DNA methylated gene with the highest beta-value was 
considered as a SEED. The SEED and expand algorithm 
was then used to identify the next connecting gene (node) 
and interaction (edge) based on the NodeStrength and 
EdgeStrength. The gene (node) with highest NodeStrength 
was considered as the next gene (node) if it satisfied 
the Gene Ontology Semantic Score > 60% for the Edge- 
Strength. Thus, subnetworks of different sizes were 
identified and connected in each of the stage-specific 
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network and ranked based on their SubnetworkStrength 
which was computed as; 

SubnetworkStrength 

y^,_^ C (NodeStrengtti) + 1 EdgeStrength 

Number of Genes 

(vi) 

Where, i are genes (nodes), / are interactions (edges), 
and k is the number of genes (nodes). 

The subnetworks were compared for their commonality 
and uniqueness across stages to identify those Significant 
DNA methylated genes that could be potential targets. 
These were then validated using literature for their im- 
portance in LUAD. In order to prove the universal nature 
of above detailed network analysis, the methodology was 
repeated for the interacting genes obtained from manually 
curated Human Signaling Network dataset (http://www. 
bri.nrc.ca/wang/). 

Additional files 



Additional file 1: p-value profile of original and corrected 
resampling data for stage I. 

Additional file 2: Pathway distribution for DNA methylated gene 
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UBC network. 



Abbreviations 

TFs: Transcription factors; NSCLC: Non-small cell lung cancer; SCLC: Small cell 
lung cancer; LUAD: Lung adenocarcinoma. 

Competing interests 

The authors declare that they have no competing interests. 
Authors' contributions 

MPP: conceptualizing and developing methodology, analysis of all the 
algorithm result, writing manuscript. AAD: data collection and analysis, 
scripting, figures and input for writing. MJP: PI of the project, conceptualizing 
the objective, writing manuscript, valuable inputs at all the time. All authors 
read and approved the final manuscript. 

Authors' information 

Akshay Desai and Meeta Pradhan are co-first authors. 
Acknowledgements 

We would like to thank Maya Palakal and Deepali Jhamb for editing the 
manuscript. We also want to thank all the members of the TiMAP laboratory 
at IUPUI for their valuable suggestions. 

Received: 1 May 2013 Accepted: 16 December 2013 
Published: 26 December 2013 

References 

1. Garnis C, Buys TP, Lam WL: Genetic alteration and gene expression 
modulation during cancer progression. Mol Cancer 2004, 3:9. 



2. Steen HB: The origin of oncogenic mutations: where is the primary 
damage? Carcinogenesis 2000, 21:1773-1776. 

3. Liu H, Su JZ, Li JH, Liu HB, Lv J, Li BY, Qiao H, Zhang Y: Prioritizing 
cancer-related genes with aberrant methylation based on a weighted 
protein-protein interaction network. Bmc Syst Biol 201 1, 5:158. 

4. Levenson V, Melnikov A: DNA Methylation as Clinically Useful 
Biomarkers-Light at the End of the Tunnel. Pharmaceuticals 2012, 5:1. 

5. Lauss M, Aine M, Sjodahl G, Veerla S, Patschan 0, Gudjonsson S, Chebil G, 
Lovgren K, Ferno M, Mansson W, et al: DNA methylation analyses of 
urothelial carcinoma reveal distinct epigenetic subtypes and an 
association between gene copy number and methylation status. 
Epigenetics-Us 2012, 7:858-867. 

6. Lund AH, van Lohuizen M: Epigenetics and cancer. Genes Dev 2004, 
18:2315-2335. 

7. Bock C: Analysing and interpreting DNA methylation data. Nat Rev Genet 
2012, 13:705-719. 

8. Esteller M: Epigenetics in Cancer. N Engl J Med 2008, 358:1 148-1 159. 

9. Brena RM, Costello JF: Genome-epigenome interactions in cancer. Hum Mol 
Genet 2007, 16:R96-R105. 

10. Esteller M: Cancer genomics: DNA methylomes and histone-modification 
maps. Nat Rev Genet 2007, 8:286-298. 

11. Rodenhiser D, Mann M: Epigenetics and human disease: translating basic 
biology into clinical applications. Can Med Assoc J 2006, 174:341-348. 

12. Du X, Han L, Guo AY, Zhao ZM: Features of methylation and gene 
expression in the promoter-associated CpG islands using human 
methylome data. Comp Funct Genom 2012, 2012:598987. 

13. Ponger L, Mouchiroud D: CpGProD: identifying CpG islands associated 
with transcription start sites in large genomic mammalian sequences. 
Bioinformatics 2002, 18:631-633. 

14. Han L, Zhao Z: CpG islands or CpG clusters:how to identify functional 
GC-rich regions in a genome? BMC Bioinformatics 2009, 1 0:65. 

15. Esteller M: Epigenetic gene silencing in cancer: the DNA 
hypermethylome. Hum Mol Genet 2007, 16:R50-R59. 

16. Chen P-Y, Feng- S, Joo JWJ, Jacobsen SE, Pellegrini M: A comparative 
analysis of DNA methylation across human embryonic stem cell lines. 
Genome Biology 2011, 12:R62. 

17. Wrzodek C, Buchel F, Hinselmann G, Eichner J, Mittag F, Zell A: Linking the 
epigenome to the genome: correlation of different features to DNA 
methylation of CpG islands. PLoS One 2012, 7:e35327. 

18. Vincent A, Omura N, Hong SM, Jaffe A, Eshleman J, Goggins M: Genome- 
wide analysis of promoter methylation associated with gene expression 
profile in pancreatic adenocarcinoma. Clin Cancer Res 2011, 17:4341-4354. 

1 9. Baylin SB, Herman JG: DNA hypermethylation in tumorigenesis - epigenetics 
joins genetics. Trends Genet 2000, 16:168-1 74. 

20. Xie L, Weichel B, Ohm JE, Zhang K: An integrative analysis of DNA 
methylation and RNA-Seq data for human heart, kidney and liver. 
Bmc Syst Biol 201 1, 5(Suppl 3):S4. 

21. Buness A, Kuner R, Ruschhaupt M, Poustka A, Sultmann H, Tresch A: 
Identification of aberrant chromosomal regions from gene expression 
microarray studies applied to human breast cancer. Bioinformatics 2007, 
23:2273-2280. 

22. Holger Heyn H, Carmona F, Gomez A, Ferreira H, Bell T, Sayols S, Ward K, 
Stefansson O, Moran S, Sandoval J, et al: DNA methylation profiling in 
breast cancer discordant identical twins identifies DOK7 as novel 
epigenetic biomarker. Carcinogenesis 2013, 34:102-108. 

23. Esteller M, Silva JM, Dominguez G, Bonilla F, Matias-Guiu X, Lerma E, Bussaglia E, 
Prat J, Harkes IC, Repasky EA, et al: Promoter hypermethylation and 
BRCA1 inactivation in sporadic breast and ovarian tumors. J Natl Cancer I 
2000, 92:564-569. 

24. Carvalho RH, Haberle V, Hou J, van Gent T, Thongjuea S, van Ucken W, 
Kockx C, Brouwer R, Rijkers E, Sieuwerts A, et al: Genome-wide DNA 
methylation profiling of non-small cell lung carcinomas. Epigenet 
Chromatin 2012, 5:9. 

25. Singh P, Yang M, Dai HF, Yu DK, Huang Q, Tan W, Kernstine KH, Lin DX, 
Shen BH: Overexpression and hypomethylation of flap endonuclease 1 
gene in breast and other cancers. Mol Cancer Res 2008, 6:1 710-1 71 7. 

26. Sun W, Liu Y, Glazer CA, Shao C, Bhan S, Demokan S, Zhao M, Rudek MA, 
Ha PK, Califano JA: TKTL1 is activated by promoter hypomethylation and 
contributes to head and neck squamous cell carcinoma carcinogenesis 
through increased aerobic glycolysis and HIF1 alpha stabilization. Clin Cancer 
Res 2010, 16:857-866. 



Pradhan et al. BMC Systems Biology 2013, 7:141 
http://www.biomedcentral.eom/1 752-0509/7/1 41 



Page 19 of 21 



27. Lokk K, Vooder T, Kolde R, Valk K, Vosa U, Roosipuu R, Milani L, Fischer K, 
Koltsina M, Urgard E, et al: Methylation markers of early-stage non-small 
cell lung cancer. PLoS One 2012, 7:e39813. 

28. Notional Cancer Institute. [http://www.canceF.gov/canceftopics/types/lung] 

29. Shinjo K, Okamoto Y, An B, Yokoyama T, Takeuchi I, Fujii M, Osada H, Usami N, 
Hasegawa Y, Ito H, et al: Integrated analysis of genetic and epigenetic 
alterations reveals CpG island methylator phenotype associated with 
distinct clinical characters of lung adenocarcinoma. Carcinogenesis 2012, 
33:1277-1285. 

30. Heller G, Babinsky VN, Ziegler B, Weinzierl M, Noll C, Altenberger C, Mullauer L, 
Dekan G, Grin Y, Lang G, et al: Genome-wide CpG island methylation 
analyses in non-small cell lung cancer patients. Carcinogenesis 2013, 
34:513-521. 

31. Kwang-ll Goh K, Cusick ME, Valle D, Childs B, Vidal M, Baraba'si AL: The 
human disease network. Proc Natl Acad Sci USA 2007, 104:8685-8690. 

32. Storey JD: A direct approach to false discovery rates. J Roy Stat Soc B 
2002, 64:479-498. 

33. Suzuki MM, Bird A: DNA methylation landscapes: provocative insights 
from epigenomics. Nat Rev Genet 2008, 9:465-476. 

34. Stark C, Breitkreutz BJ, Reguly T, Boucher L, Breitkreutz A, Tyers M: BioGRID: 
a general repository for interaction datasets. Nucleic Acids Res 2006, 
34(suppl 1):D535-D539. 

35. Reimand J, Kull M, Peterson H, Hansen J, Vilo J: g:Profiler - a web-based 
toolset for functional profiling of gene lists from large-scale experiments. 
Nucleic Acids Res 2007, 35(suppl 2):W193-W200. 

36. Reimand J, Arak T, Vilo J: g:Profiler-a web server for functional interpretation 
of gene lists (201 1 update). Nucleic Acids Res 201 1, 39(suppl 2):W307-W315. 

37. Gray S, Pandha HS, Michael A, Middleton G, Morgan R: HOX genes in 
pancreatic development and cancer. JOP 201 1, 12:216-219. 

38. Cheng W, Liu L, Yoshida H, Rosen D, Naora H: Lineage infidelity of epithelial 
ovarian cancers is controlled by HOX genes that specify regional identity in 
the reproductive tract. Nat Med 2005, 1 1 (5)537. Epub 2005, Apr 1 0. 

39. Hwang SH, Kim KU, Kim JE, Kim HH, Lee MK, Lee CH, Lee SY, Oh T, An S: 
Detection of HOXA9 gene methylation in tumor tissues and induced 
sputum samples from primary lung cancer patients. Clin Chem Lab Med 
2011,49:699-704. 

40. Rauch T, Wang ZD, Zhang XM, Zhong XY, Wu XW, Lau SK, Kernstine KH, 
Riggs AD, Pfeifer GP: Homeobox gene methylation in lung cancer studied 
by genome-wide analysis with a microarray-based methylated CpG island 
recovery assay. Proc Natl Acad Sci USA 2007, 104:5527-5532. 

41 . Charlesworth JC, Curran JE, Johnson MP, Goring HH, Dyer TD, Diego VP, 
Kent JW, Mahaney MC, Almasy L, MacCluer JW, et al: Transcriptomic 
epidemiology of smoking: the effect of smoking on gene expression in 
lymphocytes. Bmc Med Genomics 2010, 3:29. 

42. Spisak S, Kalmar A, Galamb O, Wichmann B, Sipos F, Peterfia B, Csabai I, 
Kovalszky I, Semsey S, Tulassay Z, et al: Genome-wide screening of genes 
regulated by DNA methylation in colon cancer development. PLoS One 
2012, 7:e46215. 

43. Vilas-Zornoza A, Agirre X, Martin-Palanco V, Martin-Subero Jl, San Jose-Eneriz E, 
Garate L, Alvarez S, Miranda E, Rodriguez-Otero P, Rifon J, etal: Frequent and 
simultaneous epigenetic inactivation of TP53 pathway genes in acute 
lymphoblastic leukemia. PLoS One 201 1 , 6:e1 701 2. 

44. Hartmann O, Spyratos F, Harbeck N, Dietrich D, Fassbender A, Schmitt M, 
Eppenberger-Castori S, Vuaroqueaux V, Lerebours F, Welzel K, et al: DNA 
methylation markers predict outcome in node-positive, estrogen 
receptor-positive breast cancer with adjuvant anthracycline-based 
chemotherapy. Clin Cancer Res 2009, 15:315-323. 

45. Davis E, Teng H, Bilican B, Parker Ml, Liu B, Carriera S, Goding CR, Prince S: 
Ectopic Tbx2 expression results in polyploidy and cisplatin resistance. 
Oncogene 2008, 27:976-984. 

46. Geng JF, Sun JF, Lin Q, Gu J, Zhao YX, Zhang HY, Feng X, He YH, Wang W, 
Zhou XY, et al: Methylation status of NEUROG2 and NID2 improves the 
diagnosis of stage I NSCLC. Oncol Lett 2012, 3:901-906. 

47. Rauch TA, Wang ZD, Wu XW, Kernstine KH, Riggs AD, Pfeifer GP: DNA 
methylation biomarkers for lung cancer. Tumor Biol 2012, 33:287-296. 

48. Zhang Y, Zhao H, Wang YL, Lin Y, Tan Y, Fang XX, Zheng LW: Non-small 
cell lung cancer invasion and metastasis promoted by MMP-26. Mol Med 
Rep 2011,4:1201-1209. 

49. Rebhan M, ChalifaCaspi V, Prilusky J, Lancet D: GeneCards: Integrating 
information about genes, proteins and diseases. Trends Genet 1997, 
13:163-163. 



50. Lopez-Lago MA, Thodima VJ, Guttapalli A, Chan T, Heguy A, Molina AM, 
Reuter VE, Motzer RJ, Chaganti RSK: Genomic deregulation during 
metastasis of renal cell carcinoma implements a myofibroblast-like 
program of gene expression. Cancer Res 2010, 70:9682-9692. 

51. Kwon YJ, Lee SJ, Koh JS, Kim SH, Lee HW, Kang MC, Bae JB, Kim YJ, Park JH: 
Genome-Wide Analysis of DNA Methylation and the Gene Expression 
Change in Lung Cancer. J Thorac Oncol 2012, 7:20-33. 

52. Snoussi K, Strosberg AD, Bouaouina N, Ben Ahmed S, Helal AN, 
Chouchane L: Leptin and leptin receptor polymorphisms are 
associated with increased risk and poor prognosis of breast carcinoma. 
Bmc Cancer 2006, 6:38. 

53. Ketterer K, Kong B, Frank D, Giese NA, Bauer A, Hoheisel J, Korc M, Kleeff J, 
Michalski CW, Friess H: Neuromedin U is overexpressed in pancreatic cancer 
and increases invasiveness via the hepatocyte growth factor c-Met 
pathway. Cancer Lett 2009, 277:72-81 . 

54. Lin N, Di C, Bortoff K, Fu J, Truszkowski P, Killela P, Duncan C, McLendon R, 
Bigner D, Gregory S, et al: Deletion or epigenetic silencing of AJAP1 on 
1p36 in glioblastoma. Mol Cancer Res 2012, 10:208-217. 

55. Dong J, Hu ZB, Wu C, Guo H, Zhou BS, Lv JC, Lu DR, Chen KX, Shi YY, 
Chu MJ, et al: Association analyses identify multiple new lung cancer 
susceptibility loci and their interactions with smoking in the Chinese 
population. Nat Genet 2012, 44:895-899. 

56. Mosca E, Alfieri R, Merelli I, Viti F, Calabria A, Milanesi L: A multilevel data 
integration resource for breast cancer study. Bmc Syst Biol 2010, 4:76. 

57. Racz A, Brass N, Hofer M, Sybrecht GW, Remberger K, Meese EU: Gene 
amplification at chromosome 1pter-p33 including the genes PAX7 and 
EN01 in squamous cell lung carcinoma. Int J Oncol 2000, 17:67-73. 

58. Widschwendter M, Fiegl H, Egle D, Mueller-Holzner E, Spizzo G, Marth C, 
Weisenberger DJ, Campan M, Young J, Jacobs I, et al: Epigenetic stem cell 
signature in cancer. Nat Genet 2007, 39:157-158. 

59. Yin D, Jia Y, Yu Y, Brock MV, Herman JG, Han C, Su X, Liu Y, Guo M: SOX17 
methylation inhibits its antagonism of Wnt Signaling pathway in Lung 
cancer. Discov Med 2012, 14:33-40. 

60. Tseng RC, Hsieh FJ, Hsu HS, Wang YC: Minimal deletion regions in lung 
squamous cell carcinoma: association with abnormality of the DNA 
double-strand break repair genes and their applications on gene 
identification and prognostic biomarkers. Lung Cancer 2008, 59:332-339. 

61 . Kim YH, Lee HC, Kim SY, II Yeom Y, Ryu KJ, Min BH, Kim DH, Son HJ, Rhee PL, 
Kim JJ, et al: Epigenomic analysis of aberrantly methylated genes in 
colorectal cancer identifies genes commonly affected by epigenetic 
alterations. Ann Surg Oncol 201 1, 18:2338-2347. 

62. Morris MR, Ricketts CJ, Gentle D, McRonald F, Carli N, Khalili H, Brown M, 
Kishida T, Yao M, Banks RE, et al: Genome-wide methylation analysis 
identifies epigenetically inactivated candidate tumour suppressor genes 
in renal cell carcinoma. Oncogene 201 1, 30:1390-1401. 

63. Hofsli E, Wheeler TE, Langaas M, Laegreid A, Thommesen L: Identification of 
novel neuroendocrine-specific tumour genes. Brit J Cancer 2008, 99:1330-1339. 

64. Wu XW, Rauch TA, Zhong XY, Bennett WP, Latif F, Krex D, Pfeifer GP: CpG 
island hypermethylation in human astrocytomas. Cancer Res 2010, 
70:2718-2727. 

65. Gyongyosi E, Szalmas A, Ferenczi A, Konya J, Gergely L, Veress G: Effects of 
human papillomavirus (HPV) type 16 oncoproteins on the expression of 
involucrin in human keratinocytes. Virol J 2012, 9:36. 

66. Palacios D, Summerbell D, Rigby PWJ, Boyes J: Interplay between DNA 
methylation and transcription factor availability: implications for 
developmental activation of the mouse myogenin gene. Mol Cell Biol 
2010, 30:3805-3815. 

67. Irvine RA, Lin IG, Hsieh CL: DNA methylation has a local effect on 
transcription and histone acetylation. Mol Cell Biol 2002, 22:6689-6696. 

68. Watanabe T, Miura T, Degawa Y, Fujita Y, Inoue M, Kawaguchi M, Furihata C: 
Comparison of lung cancer cell lines representing four histopathological 
subtypes with gene expression profiling using quantitative real-time 
PCR. Cancer Cell Int 2010, 10:2. 

69. Kim TH, Jo SW, Lee YS, Kim YJ, Lee SC, Kim WJ, Yun SJ: Forkhead box O-class 
1 and Forkhead box G1 as Prognostic Markers for Bladder Cancer. J Korean 
Med Sci 2009, 24:3. 

70. Metodieva SN, Nikolova DN, Cherneva RV, Dimova II, Petrov DB, Toncheva DI: 
Expression analysis of angiogenesis-related genes in Bulgarian patients 
with early-stage non-small cell lung cancer. Tumori 201 1, 97:86-94. 

71. Kelly ZL, Michael A, Butler-Manuel S, Pandha HS, Morgan RGL: HOX genes 
in ovarian cancer. J Ovarian Res 201 1, 4:16. 



Pradhan et al. BMC Systems Biology 2013, 7:141 
http://www.biomedcentral.eom/1 752-0509/7/1 41 



Page 20 of 21 



72. Zhao Y, Zhou H, Ma K, Sun J, Feng X, Geng J, Gu J, Wang W, Zhang H, He Y, 93. 
et al: Abnormal methylation of seven genes and their associations with 
clinical characteristics in early stage non-small cell lung cancer. Int J Mol 94. 
Med 2013, 5:1211-1218. 

73. Cao M, Yie SM, Liu J, Ye SR, Xia D, Gao E: Plasma soluble HLA-G is a potential 
biomarker for diagnosis of colorectal, gastric, esophageal and lung cancer. 95. 
Tissue Antigens 201 1 , 78:1 20-1 28. 

74. Kaneisha M, Goto S, Sato Y, Furumichi M, Tanabe M: KEGG for integration 

and interpretation of large-scale molecular datasets. Nucleic Acids Res 96. 
2012, 40:D109-D114. 

75. Huang X, Du X, Li Y: The role of BCL1 1 B in hematological malignancy. 97. 

Exp Hematol Oncol 201 2, 1 :22. 

76. Guenin S, Mouallif M, Deplus R, Lampe X, Krusy N, Calonne E, Delbecque K, 
Kridelka F, Fuks F, Ennaji MM, et al: Aberrant promoter methylation and 
expression of UTF1 during cervical carcinogenesis. PLoS One 2012, 98. 
7:e42704. 

77. Okamoto J, Hirata T, Chen Z, Zhou HM, Mikami I, Li H, Beltran A, Johansson M, 
Coussens LM, Clement G, et al: EMX2 is epigenetically silenced and 

suppresses growth in human lung cancer. Oncogene 2010, 29:5969-5975. 99. 

78. Anderson AR, White P, Kaestnar KH, Sussel L: Identification of known and 
novel pancreas genes expressed downstream of Nkx2.2 during development. 

BMC Development Biology 2009, 9:65. doi:1 0.1 186/1 471 -21 3X-9-65. 100. 
PMID:2003319. 

79. McCabe M, Lee E, Vertino P: A multi-factorial signature of DNA sequence 

and polycomb binding predicts aberrant CpG Island methylation. Cancer Res 101. 
2009, 69:282-291. 

80. Lleras RA, Adrien LR, Smith RV, Brown B, Jivraj N, Keller C, Sarta C, Schlecht NF, 

Harris TM, Childs G, et al: Hypermethylation of a cluster of Kruppel-type zinc 1 02. 
finger protein genes on chromosome 19q13 in oropharyngeal squamous 
cell carcinoma. Am J Pathol 201 1 , 1 78:1 965-1 974. 

81. Steinmann K, Richter A, Dammann H: Epigenetic silencing of 
erythropoietin in human cancers. Genes Cancer 201 1, 2:65-73. 

82. van Vlodrop UH, Baldewijns MML, Smits KM, Schouten LJ, van Neste L, 103. 
van Criekinge W, van Poppel H, Lerut E, Schuebel KE, Ahuja N, et al: 

Prognostic Significance of Gremlinl (GREM1) promoter CpG Island 
hypermethylation in clear cell renal cell carcinoma. Am J Pathol 2010, 
176:575-584. 104. 

83. Bennett KL, Karpenko M, Lin MT, Claus R, Arab K, Dyckhoff G, Plinkert P, 
Herpel E, Smiraglia D, Plass C: Frequently methylated tumor suppressor 

genes in head and neck squamous cell carcinoma. Cancer Res 2008, 105. 
68:4494-4499. 

84. Bailey-Wilson JE, Amos CI, Pinney SM, Petersen GM, de Andrade M, Wiest JS, 106. 
Fain P, Schwartz AG, You M, Franklin W, et al: A major lung cancer 
susceptibility locus maps to chromosome 6q23-25. Am J Hum Genet 2004, 
75:460-474. 107. 

85. Wang Y, Broderick P, Matakidou A, Eisen T, Houlston RS: Chromosome 
15q25 (CHRNA3-CHRNA5) variation impacts indirectly on lung cancer 

risk. PLoS One 201 1 , 6:e1 9085. 1 08. 

86. Welsh J, Sapinoso L, Kern S, Brown D, Liu T, Bauskin A, Ward R, Hawkins N, 
Quinn D, Russell P, et al: Large-scale delineation of secreted protein 
biomarkers overexpressed in cancer tissue and serum. Proc Natl Acad Sci 109. 
USA 2003, 100:3410-3415. 

87. Yang SH: Gene amplifications at chromosome 7 of the human gastric 

cancer genome. Int J Mol Med 2007, 20:225-23 1 . 110. 

88. Cui J, Deubler DA, Rohr LR, Zhu XL, Maxwell TM, Changus JE, Brothman AR: 
Chromosome 7 abnormalities in prostate cancer detected by dual-color 
fluorescence in situ hybridization. Cancer Genet Cytogen 1998, 107:51-60. 111. 

89. Buckingham LE, Coon JS, Morrison LE, Jacobson KKB, Jewell SS, Kaiser KA, 
Mauer AM, Muzzafar T, Polowy C, Basu S, et al: The prognostic value of 
chromosome 7 polysomy in non-small cell lung cancer patients treated 

with gefitinib. J Thorac Oncol 2007, 2:41 4-422. 1 1 2. 

90. Tsuji T, Tagawa Y, Hisamatsu T, Nakamura S, Terada R, Sawai T, Yasutake T, 
Ayabe H: p53 alterations and chromosome 17 aberrations in non-small 
cell lung cancer. Gan to kagaku ryoho Cancer & Chemotherapy 1997, 

24(Suppl 2):263-268. 113. 

91. Kamnasaran D, Cox DW: Current status of human chromosome 14. J Med 
Genet 2002, 39:81-90. 

92. Jagla K, Dolle P, Mattei MG, Jagla T, Schuhbaur B, Dretzen G, Bellard F, 114. 
Bellard M: Mouse Lbx1 and Human Lbx1 define a novel mammalian 
homeobox gene family related to the drosophila lady bird genes. 

Mech Develop 1995, 53:345-356. 



Kalari S, Pfeifer GP: Identification of driver and passenger DNA methylation 
in cancer by epigenomic analysis. Adv Genet 2010, 70:277-308. 
Chung W, Bondaruk J, Jelinek J, Lotan Y, Liang SD, Czerniak B, Issa JPJ: 
Detection of bladder cancer using novel DNA methylation biomarkers in 
urine sediments. Cancer Epidem Biomar 201 1, 20:1483-1491. 
Choi K, Creighton CJ, Stivers D, Fujimoto N, Kurie JM: Transcriptional 
profiling of non-small cell lung cancer cells with activating EGFR somatic 
mutations. PLoS One 2007, 2:e1226. 

Pinto A, Morello S, Sorrentino R: Lung cancer and Toll-like receptors. 

Cancer Immunol Immun 201 1 , 60:1 21 1 -1 220. 

Antoon JW, Lai R, Struckhoff AP, Nitschke AM, Elliott S, Martin EC, Rhodes LV, 
Yoon NS, Salvo VA, Shan B, et al: Altered death receptor signaling promotes 
epithelial-to-mesenchymal transition and acquired chemoresistance. Sci Rep 

2012, 2:539. 

Mukhopadhyay NK, Gordon GJ, Chen CJ, Bueno R, Sugarbaker DJ, Jaklitsch MT: 
Activation of focal adhesion kinase in human lung cancer cells involves 
multiple and potentially parallel signaling events. J Cell Mol Med 2005, 
9:387-397. 

Savani M, Guo Y, Carbone DP, Csiki I: Sonic hedgehog pathway expression 
in non-small cell lung cancer. Therapeutic advances in medical oncology 
2012, 4:225-233. 

Pradhan MP, Prasad NKA, Palakal MJ: A systems biology approach to the 
global analysis of transcription factors in colorectal cancer. Bmc Cancer 

2012, 12:331. 

Makinoshima H, Ishii G, Kojima M, Fujii S, Higuchi Y, Kuwata T, Ochiai A: 
PTPRZ1 regulates calmodulin phosphorylation and tumor progression in 
small-cell lung carcinoma. Bmc Cancer 2012, 12:537. 
Shigemitsu K, Sekido Y, Usami N, Mori S, Sato M, Horio Y, Hasegawa Y, 
Bader SA, Gazdar AF, Minna JD, et al: Genetic alteration of the beta-catenin 
gene (CTNNB1) in human lung cancer and malignant mesothelioma and 
identification of a new 3p21.3 homozygous deletion. Oncogene 2001, 
20:4249-4257. 

Yanagisawa K, Uchida K, Nagatake M, Masuda A, Sugiyama M, Saito T, 

Yamaki K, Takahashi T, Osada H: Heterogeneities in the biological and 

biochemical functions of Smad2 and Smad4 mutants naturally occurring 

in human lung cancers. Oncogene 2000, 19:2305-2311. 

Mise N, Savai R, Yu H, Schwarz J, Kaminski N, Eickelberg O: Zyxin is a 

TGF-(3/Smad3 target gene that regulates lung cancer cell motility via 

integrin aSpl.JBiolChem 2012, 287:31393-31405. 

Mogi A, Kuwano H: TP53 mutations in nonsmall cell lung cancer. J Biomed 

Biotechnol 2011,2011:583929. 

Nguyen DX, Chiang AC, Zhang XHF, Kim JY, Kris MG, Ladanyi M, Gerald WL, 

Massague J: WNT/TCF signaling through LEF1 and HOXB9 mediates lung 

adenocarcinoma metastasis. Cell 2009, 138:51-62. 

Mikkonen L, Pihlajamaa P, Sahu B, Zhang FP, Janne OA: Androgen receptor 

and androgen-dependent gene expression in lung. Mol Cell Endocrinol 

2010,317:14-24. 

Castro IC, Breiling A, Luetkenhaus K, Ceteci F, Hausmann S, Kress S, Lyko F, 

Rudel T, Rapp UR: MYC-induced epigenetic activation of GATA4 in lung 

adenocarcinoma. Mol Cancer /?es 2013, 11:161-1 72. 

Yokoi S, Yasui K, Mori M, lizasa T, Fujisawa T, Inazawa J: Amplification and 

overexpression of SKP2 are associated with metastasis of non-small-cell 

lung cancers to lymph nodes. Am J Pathol 2004, 165:175-180. 

Arora H, Qureshi R, Park AK, Park WY: Coordinated regulation of ATF2 by 

miR-26b in gamma-irradiated lung cancer cells. PLoS One 201 1, 

6:e23802. 

Salon C, Brambilla E, Brambilla C, Lantuejoul S, Gazzeri S, Eymin B: Altered 
pattern of Cul-1 protein expression and neddylation in human lung 
tumours: relationships with CAND1 and cyclin E protein levels. 

J Pathol 2007, 213:303-310. 

Peifer M, Fernandez-Cuesta L, Sos ML, George J, Seidel D, Kasper LH, Plenker D, 
Leenders F, Sun R, Zander T, et al: Integrative genome analyses identify 
key somatic driver mutations of small-cell lung cancer. Nat Genet 201 2, 
44:1104-1110. 

Cheng WC, Chang CW, Chen CR, Tsai ML, Shu WY, Li CY, Hsu IC: 
Identification of reference genes across physiological states for qRT-PCR 
through microarray meta-analysis. PLoS One 201 1, 6:e1 7347. 
Nguewa PA, Agorreta J, Blanco D, Lozano MD, Gomez-Roman J, Sanchez BA, 
Valles I, Pajares MJ, Pio R, Rodriguez MJ, et al: Identification of Importin 8 
(IP08) as the most accurate reference gene for the clinicopathological 
analysis of lung specimens. Bmc Mol Biol 2008, 9:103. 



Pradhan et al. BMC Systems Biology 201 3, 7:1 41 Page 21 of 21 

http://www.biomedcentral.eom/1 752-0509/7/1 41 



115. 



116. 



117. 



119. 
120. 



121. 

122. 

123. 

124. 

125. 

126. 

127. 

128. 

129. 

130. 
131. 

132. 

133. 
134. 
135. 



Brose MS, Volpe P, Feldman M, Kumar M, Rishi I, Gerrero R, Einhorn E, Herlyn M, 
Minna J, Nicholson A, et al: BRAF and RAS mutations in human lung cancer 
and melanoma. Cancer Res 2002, 62:6997-7000. 

Okudela K, Suzuki M, Kageyama S, Bunai T, Nagura K, Igarashi H, Takamochi K, 
Suzuki K, Yamada T, Niwa H, et al: PIK3CA mutation and amplification in 
human lung cancer. Pathol Int 2007, 57:664-671 . 
Zhou J, Chen GB, Tang YC, Sinha RA, Wu Y, Yap CS, Wang G, Hu J, Xia X, 
Tan P, et al: Genetic and bioinformatic analyses of the expression and 
function of PI3K regulatory subunit PIK3R3 in an Asian patient gastric 
cancer library. Bmc Med Genomics 2012, 5:34. 

Xu L, Wen Z, Zhou Y, Liu Z, Li Q, Fei G, Luo J, Ren T: MicroRNA-7regulated 
TLR9 signaling enhanced growth and metastatic potential of human lung 
cancer cells by altering PIK3R3/Akt pathway, molbiolcellorg 2012, 24:42-55. 
Cancer Genome Atlas, http://cancergenome.nih.gov. 
Cui Q, Ma Y, Jaramillo M, Bari H, Awan A, Yang S, Zhang S, Liu L, Lu M, 
O'Connor-McCourt M: A map of human cancer signaling. Mol Syst Biol 
2006, 3:152. 

Awan A, Bari H, Yan F, Moksong S, Yang S, Chowdhury S, Cui Q, Yu Z, 

Purisima E, Wang E: Regulatory network motifs and hotspots of cancer genes 

in a mammalian cellular signalling network. IET Syst Biol 2007, 1:292-297. 

Li L, Tibiche C, Fu C, Kaneko T, Moran MF, Schiller MR, Li SS-C, Wang E: The 

human phosphotyrosine signaling network: Evolution and hotspots of 

hijacking in cancer. Genome Res 2012, 22:1222-1230. 

Newman RH, Hu J, Rho H-S, Xie Z, Woodard C, Neiswinger J, Cooper C, 

Shirley M, Clark HM, Hu S: Construction of human activity-based 

phosphorylation networks. Mol Syst Biol 201 3, 9:655. 

Chao C, Kay G, Judith B, Dandan Z, Elliot G, Li J, Liu C: Removing batch 

effects in analysis of expression microarray data: an evaluation of six 

batch adjustments methods. PLoS One 2010, 6:e17238. 

Du P, Zhang XA, Huang CC, Jafari N, Kibbe WA, Hou LF, Lin SM: Comparison 

of Beta-value and M-value methods for quantifying methylation levels by 

microarray analysis. BMC Bioinformatics 201 0, 1 1 :587. 

Kruskal WH: Historical notes on the Wilcoxon unpaired two-sample test. 

J Am Stat Assoc 1957, 52:356-360. 

Hollander M, Wolfe DA: Nonparametric Statistical Methods- Wiley Series in Probability 
and Statistics. 2nd edition. Wiley- Interscience; 1999. 978-0471 190455. 
Rindskopf D: An introduction to the bootstrap - Efron, B, Tibshirani, RJ. 

J Educ Behav Stat 1997, 22:245-245. 

Kaneisha M, Goto S: KEGG: kyoto enclyopedia of genes and genomes. 

Nucleic Acids Res 2000, 28:27-30. 

Newman MEJ: Networks: An Introduction. Oxford: Oxford University Press; 2010. 
Watts DJ, Strogatz SH: Collective dynamics of 'small-world' networks. 

Nature 1 998, 393:440-442. 

Yu GC, Li F, Qin YD, Bo XC, Wu YB, Wang SQ: GOSemSim: an R package for 
measuring semantic similarity among GO terms and gene products. 

Bioinformatics 2010, 26:976-978. 

Vidal M, Cusick ME, Barabasi AL: Interactome networks and human 
disease. Cell 2011, 144:6. 

Chuang HY, Lee E, Liu YT, Lee D, IdekerT: Network-based classification of 
breast cancer metastasis. Mol Syst Biol 2007, 3:140. 
MilenkovicT, Memisevic V, Bonato A, Przulj N: Dominating biological 
networks. PLoS One 201 1, 6:e23016. 



doi:1 0.1 186/1752-0509-7-141 

Cite this article as: Pradhan et al.: Systems biology approach to stage-wise 
characterization of epigenetic genes in lung adenocarcinoma. BMC Systems 
Biology 201 3 7:141. 



Submit your next manuscript to BioMed Central 
and take full advantage of: 

• Convenient online submission 

• Thorough peer review 

• No space constraints or color figure charges 

• Immediate publication on acceptance 

• Inclusion in PubMed, CAS, Scopus and Google Scholar 

• Research which is freely available for redistribution 



Submit your manuscript at 
www.biomedcentral.com/submit 



o 



BioMed Central 



